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ANHA Series Preface 


The Applied and Numerical Harmonic Analysis (ANHA) book series aims to 
provide the engineering, mathematical, and scientific communities with sig- 
nificant developments in harmonic analysis, ranging from abstract harmonic 
analysis to basic applications. The title of the series reflects the importance 
of applications and numerical implementation, but richness and relevance of 
applications and implementation depend fundamentally on the structure and 
depth of theoretical underpinnings. Thus, from our point of view, the inter- 
leaving of theory and applications and their creative symbiotic evolution is 
axiomatic. 

Harmonic analysis is a wellspring of ideas and applicability that has flour- 
ished, developed, and deepened over time within many disciplines and by 
means of creative cross-fertilization with diverse areas. The intricate and fun- 
damental relationship between harmonic analysis and fields such as signal 
processing, partial differential equations (PDEs), and image processing is re- 
flected in our state-of-the-art ANHA series. 

Our vision of modern harmonic analysis includes mathematical areas such 
as wavelet theory, Banach algebras, classical Fourier analysis, time-frequency 
analysis, and fractal geometry, as well as the diverse topics that impinge on 
them. 

For example, wavelet theory can be considered an appropriate tool to 
deal with some basic problems in digital signal processing, speech and image 
processing, geophysics, pattern recognition, biomedical engineering, and tur- 
bulence. These areas implement the latest technology from sampling methods 
on surfaces to fast algorithms and computer vision methods. The underlying 
mathematics of wavelet theory depends not only on classical Fourier analysis, 
but also on ideas from abstract harmonic analysis, including von Neumann 
algebras and the affine group. This leads to a study of the Heisenberg group 
and its relationship to Gabor systems, and of the metaplectic group for a 
meaningful interaction of signal decomposition methods. The unifying influ- 
ence of wavelet theory in the aforementioned topics illustrates the justification 
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for providing a means for centralizing and disseminating information from the 
broader, but still focused, area of harmonic analysis. This will be a key role 
of ANHA. We intend to publish with the scope and interaction that such a 
host of issues demands. 

Along with our commitment to publish mathematically significant works at 
the frontiers of harmonic analysis, we have a comparably strong commitment 
to publish major advances in the following applicable topics in which harmonic 
analysis plays a substantial role: 


Antenna theory Prediction theory 
Biomedical signal processing Radar applications 

Digital signal processing Sampling theory 
Fast algorithms Spectral estimation 

Gabor theory and applications Speech processing 
Image processing Time-frequency and 
Numerical partial differential equations time-scale analysis 

Wavelet theory 


The above point of view for the ANHA book series is inspired by the 
history of Fourier analysis itself, whose tentacles reach into so many fields. 

In the last two centuries Fourier analysis has had a major impact on the 
development of mathematics, on the understanding of many engineering and 
scientific phenomena, and on the solution of some of the most important prob- 
lems in mathematics and the sciences. Historically, Fourier series were devel- 
oped in the analysis of some of the classical PDEs of mathematical physics; 
these series were used to solve such equations. In order to understand Fourier 
series and the kinds of solutions they could represent, some of the most basic 
notions of analysis were defined, e.g., the concept of “function.” Since the 
coefficients of Fourier series are integrals, it is no surprise that Riemann inte- 
grals were conceived to deal with uniqueness properties of trigonometric series. 
Cantor’s set theory was also developed because of such uniqueness questions. 

A basic problem in Fourier analysis is to show how complicated phenom- 
ena, such as sound waves, can be described in terms of elementary harmonics. 
There are two aspects of this problem: first, to find, or even define properly, 
the harmonics or spectrum of a given phenomenon, e.g., the spectroscopy 
problem in optics; second, to determine which phenomena can be constructed 
from given classes of harmonics, as done, for example, by the mechanical syn- 
thesizers in tidal analysis. 

Fourier analysis is also the natural setting for many other problems in 
engineering, mathematics, and the sciences. For example, Wiener’s Tauberian 
theorem in Fourier analysis not only characterizes the behavior of the prime 
numbers, but also provides the proper notion of spectrum for phenomena such 
as white light; this latter process leads to the Fourier analysis associated with 
correlation functions in filtering and prediction problems, and these problems, 
in turn, deal naturally with Hardy spaces in the theory of complex variables. 
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Nowadays, some of the theory of PDEs has given way to the study of 
Fourier integral operators. Problems in antenna theory are studied in terms 
of unimodular trigonometric polynomials. Applications of Fourier analysis 
abound in signal processing, whether with the Fast Fourier transform (FFT), 
or filter design, or the adaptive modeling inherent in time-frequency-scale 
methods such as wavelet theory. The coherent states of mathematical physics 
are translated and modulated Fourier transforms, and these are used, in con- 
junction with the uncertainty principle, for dealing with signal reconstruction 
in communications theory. We are back to the raison d’étre of the ANHA 
series! 


John J. Benedetto 
Series Editor 
University of Maryland 
College Park 


Preface 


Seven years have passed since we finished editing the first book on discrete to- 
mography: Discrete Tomography: Foundations, Algorithms, and Applications 
(Birkhauser, Boston, 1999). There has been a flowering of the field since that 
time. New research groups have started, new theoretical and practical results 
have been presented, and new applications have developed. There have been 
about 200 papers published on discrete tomography since 1999. 

The current book reports on present advances in discrete tomography. Its 
structure is the same as that of the previous one: after an introduction (Chap- 
ter 1) there are chapters on new theoretical foundations (Chapters 2-7), re- 
construction algorithms (Chapters 8-11), and selected applications (Chapters 
12-16). The level of presentation aims at a potential readership of mathemati- 
cians, programmers, engineers, researchers working in the application areas, 
and students in applied mathematics, computer imaging, biomedical imaging, 
computer engineering, and/or image processing. 
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Introduction 


A. Kuba and G.T. Herman 


Summary. This chapter discuses the excellent progress made in discrete tomog- 
raphy (DT) during the last seven years and includes a comprehensive bibliography 
illustrating this progress. It also presents some of the fundamental definitions rele- 
vant to DT. 


1.1 Discrete Tomography since 1999 


In the historical overview of discrete tomography (DT) [123] that was pre- 
sented in the predecessor [106] of this book, the earliest reference is to an 1838 
paper of J. Steiner. This is because much work that is now considered relevant 
to DT had been done, in mathematical analysis, combinatorics, and geome- 
try, even before the idea of DT occurred. The first papers explicitly dealing 
with DT, in the sense that their titles contained the phrase “reconstruction of 
binary patterns,” appeared in the early 1970s (e.g., by S.K. Chang and G.T. 
Herman). However, it was not until 1994 that a scientific meeting devoted to 
DT took place. That year L. Shepp organized the DIMACS Mini-Symposium 
on Discrete Tomography, where the participants discussed problems related 
mostly to the reconstruction of finite lattice sets from their projections taken 
along straight lines. In the following five years, DT had an explosive develop- 
ment; several workshops were held and many relevant papers were published. 
The first book devoted to DT appeared in 1999 [106] and contained a col- 
lection of papers on DT foundations, algorithms, and applications, based on 
earlier presentations at a DT workshop. Its first chapter [123] provided an 
overview of DT up to the time of its publication. In the current book we do 
not repeat material that already appeared [106] but concentrate on advances 
that have taken place since 1999. 

During this period, several meetings devoted entirely to DT have been 
organized: 


(a) Meeting on the Mathematics of Discrete Tomography, December 3-9, 2000, 
Oberwolfach, Germany (organizers: P. Gritzmann and R.J. Gardner). 


2 A. Kuba and G.T. Herman 


(b) Workshop on Discrete Tomography: Algorithms and Applications, 2001, 
Certosa di Pontignano, Italy (organizers: A. Del Lungo, P. Gronchi, and 
G.T. Herman). 

(c) Workshop on Discrete Tomography and Its Applications, June 13-15, 
2005, New York City, USA (organizers: G.T. Herman and A. Kuba). 


Furthermore, there were meetings, such as the International Conferences 
on Discrete Geometry for Computer Imagery (DGCI 2000, 2002, 2003, 2005, 
2006) and the International Workshops on Combinatorial Image Analysis (IW- 
CIA 2001, 2003, 2004, 2005) at which many papers on DT were presented, 
sometimes in special sessions devoted to this topic. Special issues of journals 
were also devoted to DT: 


(a) Linear Algebra and Its Applications 339, 2001 (special issue editors: A. 
Del Lungo, P. Gronchi, and G.T. Herman). 

(b) Electronic Notes on Discrete Mathematics 12, 2005 (special issue editors: 
G.T. Herman and A. Kuba). 


Due to all this activity, as well as an impressive number of papers that 
have been published on DT independently of the above-mentioned workshops 
and special issues, we find that the last seven years have produced in the 
order of 200 publications [1-191] relevant to our topic. The current book is 
the culmination of some of this activity. Its chapters were invited by the edi- 
tors based on the presentations that were given at the Workshop on Discrete 
'Tomography and Its Applications, June 13-15, 2005, in New York City. They 
were refereed, revised, and edited with the aim of producing between them 
a cohesive coverage of some of the important advances in D'T since 1999. 
'The book is divided into three parts: Foundations of DT; D'T Reconstruction 
Algorithms; and Applications of DT. 


1.2 Definitions 


The introductory chapter [123] of [106] starts with the following definition of 
DT: 


We assume that there is a domain, which may itself be discrete (such 
as a set of ordered pairs of integers) or continuous (such as Euclidean 
space). We further assume that there is an unknown function f whose 
range is known to be a given discrete set (usually of real numbers). 
'The problems of discrete tomography, as we perceive the field, have to 
do with determining f (perhaps only partially, perhaps only approxi- 
mately) from weighted sums over subsets of its domain in the discrete 
case and from weighted integrals over subspaces of its domain in the 
continuous case. 
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This definition is much more general in scope than how DT had been con- 
sidered by some practitioners, namely that it focuses on subsets of the integer 
lattice. Our definition includes, for example, problems involving the recovery 
of planar convex sets from their integrals. Since neither the statements nor the 
solution of such problems involve much discrete mathematics, the definition is 
debatable. Of course, the word “discrete” in the definition is not referring to 
the nature of the methods to be used, but rather to the nature of the range of 
the function to be recovered. Approached this way, planar convex sets are rep- 
resented by their characteristic functions, which are of course binary-valued. 
As we explain below, the tomography of measurable sets and the tomography 
of finite sets overlap somewhat, and we find it convenient to bring both under 
the same heading. 

In fact, the tomography of planar measurable sets is subsumed under the 
already established field of geometric tomography. This term was introduced 
by R.J. Gardner at the 1990 Oberwolfach meeting on tomography and is de- 
fined in his book [90] as the area of mathematics dealing with the retrieval of 
information about a geometric object from data about its sections, or projec- 
tions, or both. The term “projection” here means orthogonal projection, and 
according to Gardner, the phrase “geometric object” is deliberately vague: 
a convex polytope or body would certainly qualify, as would a star-shaped 
body, or even, when appropriate, a compact set or measurable set. 

Clearly, then, the tomography of measurable sets falls under both DT, as 
we define it, and geometric tomography. Researchers should be aware of this 
and choose whichever term they feel more suitable as a general description. 

It is a characteristic property of all young fields of research that the ter- 
minology is not yet settled. DT is no exception and in some cases different 
names are used for the same things. For example, some people use “X-ray,” 
others “marginal,” and yet others “discrete Radon-transform” to refer to the 
weighted sums mentioned in the definition above; here we selected to use the 
term “projection.” 

There are many different kinds of objects studied in DT, e.g., lattice sets, 
binary matrices, digital or label images, measurable sets, dominoes, etc. In 
general, they can be represented as functions with a domain X and with a 
given discrete range (of real numbers). Let us denote the class of functions to 
be studied by €. Let S be a collection of subsets of the domain (X) of the 
functions in € (e.g., lines, strips, hyperplanes). The projection of a function 
in € onto a subset S in S is the weighted sum (alternatively, integral) 


PAS) = Dees: (= [ve t) a) 
res S 
where w : X x S — R is a given weight function. (The weight function is 
often chosen, in particular in the examples below, to have the constant value 
1. However, this is not always the case; see Chapters 13 and 16 of this book for 
examples.) With this notation, the uniqueness, existence, and reconstruction 
problems can be stated as follows. 
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UNIQUENESS(E, S). 
Given: A function f € €. 
Question: Does there exist a function f' € € different from f such that 
f and f' have the same projections onto S “for all" S in S? 


EXISTENCE(E, S). 
Given: A real-valued function g defined on S. 


Question: Does there exist a function f € E such that [Pf] (S) = g(S) 
“for all” S in S? 


RECONSTRUCTION(£, S). 


Given: A real-valued function g defined on S. 
Task: Construct a function f € € such that [Pf](S) = g(S) “for 
all” S in S. 


The reason for putting “for all" into quotes, is that sometimes things are 
known to us only partially; we now give two illustrations of the general ter- 
minology, the second of which makes essential use of partial information. 

In the first example, X = Z?, where Z denotes the set of integers. E is 
the set of (0, 1]-valued functions on X, such that the number elements of X 
for which the value is 1 is finite. (These are the characteristic functions of 
finite lattice sets.) The elements of S are the horizontal and vertical lattice 
lines; i.e., sets of the form {(i, j) | j = jo) or {(i, 3) | i = io}. The weight w is 
always 1, and so the projections are just sums of the function values on a 
horizontal or a vertical lattice line. For this example, “for all" really means 
for all. A switching component of an f € € is a set of four points in X of 
the form t= (41,91), v2 = (i2, j1), v3 = (41, j2) and t4 = (i2, j2), such that 
f(x1) Z f(x2), but f(x1) = f(x3) and f(x2) = f(x4). It is a well-known 
result of DT (published in 1957 by H.J. Ryser) that under the definitions of 
this paragraph, a function f is not unique (according to the definition given 
above) if, and only if, f has a switching component (see Theorem 1.4 of [123]). 

In the second example, X — R?, the Euclidean plane, and £ is the set 
of (0, 1]-valued functions f on X, such that the set of elements of X for 
which the value is 1 (the support of f) is of a finite Lebesgue measure. More 
precisely, elements of £ are equivalence classes of such functions: two functions 
are considered equivalent if the symmetric difference between their supports 
is of measure zero. The elements of S are the horizontal and vertical lines (i.e., 
the lines parallel to the axes of R2); there are uncountably many such lines. 
Selecting the weight w to again be always 1, the natural interpretation of the 
integral in (1.1) is that it is a line integral of f along S, providing us with two 
functions of one variable each (one for the horizontal and one for the vertical 
lines). However, either of these two functions may be undefined on a set of 
measure zero (hence the quotes on “for all"). Having the same projections, in 
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this context, is to be interpreted as the same everywhere except on a set of 
measure zero. In a 1988 paper, A. Kuba and A. Volčič introduced the notion 
of a switching component in this context as well (essentially, it consists of four 
vertexes of a rectangle defined by two horizontal and two vertical lines in R? 
and a set of positive measure that is shifted to each of the four points, such 
that points that belong to the sets that are shifted to diagonally opposing 
vertexes of the rectangle are either all in the support of f or none are in the 
support of f, with the opposite being the case for the points of the sets that 
are shifted to the other two vertexes; see, e.g., Chapter 5 of [106]) and proved 
that a function f is not unique if, and only if, f has a switching component. It 
is interesting to observe that the theorem of Ryser mentioned in the previous 
paragraph is, in fact, derivable from this result (by defining for a function 
over a lattice set another one over the Euclidean plane by making it piecewise 
constant in the interiors of the Voronoi neighborhoods of the lattice points). 

This discussion is a minor illustration (using the uniqueness problem) of 
our viewpoint that the tomography of measurable planar sets and the tomog- 
raphy of lattice sets share some basic ideas and methods of proof. Another 
example (which we do not discuss in detail) is provided by the relationship 
between the existence problems for measurable planar sets and lattice sets, 
as characterized by the theorems of Lorentz and Gale-Ryser (both mentioned 
later on in this book), respectively. It is therefore our terminological pref- 
erence to consider both the tomography of measurable planar sets and the 
tomography of lattice sets to be areas of discrete tomography as defined in 
the first paragraph of this section. 


1.3 Conclusions 


The material presented in this chapter clearly demonstrates that DT is an 
extremely active field of research. In recent years it has been producing new 
and powerful results not only in its theoretical foundations and reconstruc- 
tion algorithms, but also in new and previously proposed applications. An 
argument has been put forth that the field of DT should be considered to 
include not only problems capable of being treated by discrete methods, but 
also those that require continuous treatment, as long as their subject matter 
is the recovery of functions with a discrete range. 
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An Introduction to Discrete Point X-Rays 


P. Dulio, R.J. Gardner, and C. Peri 


Summary. A discrete point X-ray of a finite subset F of R” at a point p gives the 
number of points in F lying on each line passing through p. We survey the known 
results on discrete point X-rays, which mostly concern uniqueness issues for subsets 
of the integer lattice. 


2.1 Introduction 


The (continuous) parallel X-ray of a convex body K in n-dimensional Eu- 
clidean space R” in a direction u gives the lengths of all the intersections 
of K with lines parallel to u, and the (continuous) point X-ray of K at a 
point p € R” gives the lengths of all the intersections of K with lines passing 
through p. (See Section 2.2 for all terminology.) In 1963, P.C. Hammer asked: 
How many parallel (or point) X-rays are needed to determine any convex body 
among all convex bodies? Answers to these questions are now known and are 
surveyed in [8, Chapters 1 and 5]. The topic of determining convex bodies 
and more general sets by their X-rays forms part of a larger area of inverse 
problems called geometric tomography, which concerns the retrieval of infor- 
mation about a geometric object (for example, a convex body, star-shaped 
body, or compact set) via measurements of its sections by lines or planes or 
its orthogonal projections on lines or planes. 

Around 1994, Larry Shepp introduced the term discrete tomography. Here 
the focus is on determining finite subsets of the n-dimensional integer lattice 
Z" by means of their discrete parallel X-rays. A discrete parallel X-ray of a 
finite subset F of Z” in the direction of a vector v € Z” gives the number of 
points in F lying on each line parallel to v. 

By now there are many results available on continuous parallel or point 
X-rays of sets and on discrete parallel X-rays of finite subsets of the integer 
lattice. Here we consider the obvious remaining category of X-rays, namely, 
discrete point X-rays. The definition is the natural one: A discrete point X-ray 
of a finite subset F of R” at a point p € IR" gives the number of points in F 
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lying on each line passing through p. Note that the above definition of discrete 
parallel X-ray also extends readily to finite subsets of R”, but as in that case, 
the main interest here is with discrete point X-rays of finite subsets of Z” at 
points in Z”. 

In order to describe our results, it is useful to briefly recall the correspond- 
ing results for discrete parallel X-rays. First, given any finite set U of lattice 
directions in Z?, there are different finite subsets of Z? with equal discrete 
parallel X-rays in the directions in U (see [8, Lemma 2.3.2] or [10, Theo- 
rem 4.3.1]). In view of this, Gardner and Gritzmann [9] focused on convex 
lattice sets, employing the notion of a U-polygon in R? for a given set U of 
directions. Roughly speaking, a U-polygon is a nondegenerate convex polygon 
whose vertices line up in pairs when viewed from a direction in U; see Sec- 
tion 2.2 for the formal definition. When a lattice U-polygon exists, it is easy to 
construct two different convex lattice sets with equal discrete parallel X-rays 
in the directions in U. In [9] it was proved that in fact the nonexistence of a 
lattice U-polygon is necessary and sufficient for the discrete parallel X-rays 
in the directions in U to determine convex lattice sets (provided U has at 
least two nonparallel directions). It is easy to see that when |U| — 3, lattice 
U-polygons always exist. With tools from p-adic number theory, it was shown 
in [9] that they do not exist for certain sets of four lattice directions and any 
set of at least seven lattice directions, but can exist for certain sets of six lat- 
tice directions. Corresponding uniqueness results for discrete parallel X-rays 
follow immediately. 

For discrete point X-rays, there is also a general lack of uniqueness: Given 
any finite set P of points in Z?, there are different finite subsets of Z? with 
equal discrete point X-rays at the points in P. The proof, given in [7, The- 
orem 3.1], is much more involved than for parallel X-rays, requiring an un- 
expected use of the existence of arbitrarily long arithmetic progressions of 
relatively prime numbers. With this nonuniqueness result in hand, we focus 
on convex lattice sets in Z?. In Section 2.3, we provide a rather complete anal- 
ysis when discrete point X-rays are taken at two points. We also note that it 
is hopeless to obtain uniqueness results unless the class of convex lattice sets 
is restricted to those not meeting any line through two of the points at which 
the X-rays are taken, a condition that we shall assume for the remainder of 
this introduction. 

Lemma 2 shows that, as with parallel discrete X-rays, uniqueness results 
for point discrete X-rays hinge on the nonexistence of special lattice polygons 
we call lattice P-polygons, for finite subsets P of Z. (However, the connection 
is less clear than in the parallel case.) An example of a lattice P-polygon is 
shown in Figure 2.1, where P = {(0,0), (210,0), (0, 210)} and the P-polygon 
is the hexagon of the other labelled lattice points. In [7, Theorem 5.1], the 
existence of lattice P-polygons for sets of three collinear lattice points is es- 
tablished. It follows that for uniqueness when the points in P are collinear, P 
must contain at least four points. In Section 2.4, the above results on discrete 
parallel X-rays are combined with the use of a new measure and projective 
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transformations to prove that when the points in P are collinear, uniqueness 
is obtained for certain sets of four points and any set of at least seven points, 
while six points are generally not enough. 

Some results on noncollinear sets P are summarized in Section 2.5. For 
example, there are sets P with as many as six noncollinear points in Z? such 
that the corresponding discrete point X-rays do not determine convex lattice 
sets. 

The development of the paper through Section 2.5 follows the extended 
abstract [6] of the full article [7] in which the authors introduce point X- 
rays. To this is added here a final Section 2.6, containing brief remarks about 
instability results due to Katja Lord (to whom we are grateful for permission 
to include this summary), the possibility of measuring discrete point X-rays 
in practice, and our ideas for future work. 


2.2 Definitions and Preliminaries 


As usual, $”~! denotes the unit sphere and o the origin in Euclidean n-space 
R”. If u € R”, we denote by u^ the (n — 1)-dimensional subspace orthogonal 
to u. The standard orthonormal basis for R” will be {e1,..., en}. The line 
segment with endpoints x and y is denoted by [x, y], and we write Lr, y] for 
the line through x and y. 

If A is a set, we denote by |A|, 0A, and conv A the cardinality, boundary, 
and convez hull of A, respectively. The notation for the usual orthogonal 
projection of A on a subspace S is A|S. The symmetric difference of two sets 
A and A» is A; A A43 = (Ai \ Ap) U (A2 \ 41). 

We denote n-dimensional projective space by P”, and regard it as R”UH æ, 
where Ha is the hyperplane at infinity. Points in Hə can be associated with 
a pair (u, —u} of directions in S"-1. If ¢ is a projective transformation from 
P” onto P” mapping a point p in Ha to a finite point op in P™ (i.e., a point 
in R™), then lines in R” parallel to a direction u associated with p map to 
lines passing through óp. If E C P” is such that dE C R™ (i.e., GE does 
not meet the hyperplane at infinity in IP"), then ¢ is called permissible for E. 
Note that ¢ preserves the convexity of sets in IR" for which it is permissible. 
See [8, pp. 2, 7] for more details. 

The cross ratio (pi, pa, pa, pa) of four points p;, i = 1,...,4, in a line L is 
given by 
(zs — z1)(z4 — 22) 


Gr ar (20) 


(p1, P2, P3, P4) = 
where z; is the coordinate of p;, i = 1,...,4, in some fixed Cartesian coordi- 
nate system in L. See, for example, [3, Section 6.2]. 

A convex polytope is the convex hull of a finite subset of R”. We sometimes 
refer to a finite subset of the n-dimensional integer lattice Z” as a lattice set. 
A convex lattice set is a finite subset F of Z” such that F = (conv F) n Z^. 
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A lattice polygon is a convex polygon with its vertices in Z?. A polygon is 
called rational if its vertices have rational coordinates. A lattice line is a line 
containing at least two points in Z?. 

Call a vector u € Z" primitive if the line segment [o, u] contains no lattice 
points other than o and u. 

Let F be a finite subset of R” and let u € R" \ (o). The discrete parallel 
X-ray of F parallel to u is the function X,,F defined by 


X,F(v) = |F à (L[o,u] + v)| , (2.2) 


for each v € u^. The function X,,F is in effect the projection, counted with 
multiplicity, of F on u^. For an introduction to the many known results on 
discrete parallel X-rays and their applications, see [4], [9], [10], and [11]. 

Let F be a finite subset of R” and let p € R”. The discrete point X-ray of 
F at p is the function X, F defined by 


XpF(u) = |F N (Lo, u] + p)| , (2.3) 


for each u € R” X (oj. 

Let U be a finite set of vectors in R?. We call a nondegenerate convex 
polygon Q a U-polygon if it has the following property: If v is a vertex of Q, 
and u € U, then the line v + L[o, u] meets a different vertex v' of Q. 

Let P be a finite set of points in R?. A nondegenerate convex polygon Q 
is a P-polygon if it satisfies the following property: If v is a vertex of Q, and 
p € P, then the line L[p, v] meets a different vertex v' of Q. 

Note that in view of these definitions, a lattice P-polygon is a convex 
subset of R?, while a convex lattice polygon is a finite subset of Z?. 

There is a convenient common generalization of the previous two defini- 
tions. Consider P? = R? U H, and let P be a finite set of points in P?. A 
nondegenerate convex polygon Q in R? is a P-polygon if it satisfies the fol- 
lowing property: If v is a vertex of Q, and p € P, then the line L[p, v] in P? 
meets a different vertex v’ of Q. Note that if P C Hæ, then the P-polygon Q 
is also a U-polygon for the set U of unit vectors associated with points in P. 

Let F be a class of finite sets in IR" and P a finite set of points in R”. We 
say that F € F is determined by the discrete point X-rays at the points in P 
if whenever F’ € F and X,F = X,F' for all p € P, we have F = F”. 


2.3 Discrete Point X-Rays at Two Points 


Theorem 1. Let pı and pz be distinct points in Z?. Then there are different 
convex lattice sets that meet L[p1, po] and have equal discrete point X-rays at 
pi and pə. 


Proof. Without loss of generality, let pı = (0,0) and po = (k,0) for some 
k > 0. Suppose that m € N. Then the sets Kı = {(k +7,0) |i =1,...,m} 
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and Kə = {(k + i,0) | à = 2,...,m+ 1} have equal discrete point X-rays 
at pı and p». By adjoining the point (k + m,1) to both sets, we can obtain 
two-dimensional examples with the same property. 














Note that the sets Kı and Kə in the previous theorem also have the same 
discrete point X-rays at any lattice point on the z-axis. 


Theorem 2. Let Kj, i = 1,2 be convex lattice sets in Z? with equal discrete 
point X-rays at distinct points p1, p2 € Z?. Suppose that 


(a) Lipi, po] N K: = 0, i = 1,2, and 
(b) conv Kı and conv Ko either both meet |pı, pə] or both meet L{pi, pə] N 
pi, p2]. 


Then Kı = Ko. 


Proof. By (a) and the fact that K;, i = 1,2 are convex lattice sets, we have 
pi € conv Ky U conv Ks, i = 1,2. Suppose that conv Kı and conv Kə both 
meet L[pi, po] \ [p1, p2]. If pı and po lie between conv K, and conv Ko, these 
sets cannot have equal supporting lines from pı and ps, contradicting the 
equality of the discrete point X-rays of Kı and Kə at p; and p2. Then we 
may assume that pi, po, and L[pi, po] N conv K;, i = 1,2 are in that order on 
L[pi, pa]. Suppose that Kı # Kə. Without loss of generality, we may assume 
that L[pi, p2] is the z-axis. Then by (a), we can assume that (Ky A Ko) {y > 
0} Æ 0. Let Lı be the line through pz and containing a point of K1/ Ko, with 
minimal positive angle with the z-axis. Since Kı and Kə have equal discrete 
point X-rays at po, there are points vı € Ky \ K2 and v2 € Kə \ Kı on Ih, 
and we can assume that po, v1, and vz are in that order on Lı. Since Ky and 
K have equal discrete point X-rays at pı, the line Lə through pı and vı must 
meet K5VK in a point vs. If pi, v1, and v3 are in that order on Lə, then the 
line through po and v3 has a smaller positive angle with the x-axis than L4. 
Therefore, v3 € [p1, vj]. Assumptions (a) and (b) imply that there is a point 
c € Kon (y < 0}, but then vı ¢ K» lies in the interior of the triangle with 
vertices vo, v3, and c, all of which lie in K5. This contradicts the fact that Kə 
is a convex lattice set, and proves that Kı = Ko. 

The case when conv Ay and conv Kə both meet [pi, po] is proved in similar 
fashion. 














The next result shows that the assumption (b) in Theorem 2 is necessary. 


Theorem 3. Let pı and pz be distinct points in Z?. Then there are different 
convex lattice sets Kı and Ky such that L|pi, pa] O Ki = 0 and L[py, pa] N 
conv K; Æ (), i = 1,2, and with equal discrete point X-rays at py and pa. 


Proof. Let pı = (0,0), and let po = ku, where u € Z? is primitive and k € N. 
Then there is a v € Z? such that (u, v} is a basis in R7. The unimodular affine 
transformation mapping {u,v} to {e1,e2} is a bijection of Z? onto itself pre- 
serving convexity and incidence. Therefore, we may, without loss of generality, 
take pı = (0,0) and pa = (k,0) for some k > 0. 
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Suppose that k = 1. Then the sets Kı = {(2,3),(—1,—2)} and K2 = 
{(3,6), (—2, —3)} fulfill the requirements of the theorem. 

Now suppose that k > 1. Let a = (k, k), b = (k,k+ 1), c = (—-k(k—1),1— 
k?), and d = (—k(k? — 1), —k(k? — 1)). Let Kı = {a,c} and Ko = {b, d}. It is 
easy to check that L[pi, po] N conv K; 4 0, i = 1,2 and that the sets Kı and 
Kə have equal discrete point X-rays at pı and pa. It remains to show that Kı 
and Kə are convex lattice sets. To this end, note that the line L[a, c] has slope 
(k? + k — 1)/k?. Moreover, k? -- k — 1 and k? are relatively prime; otherwise, if 
p > lis prime, p|(k? - k — 1), and p|K?, then p|(k — 1), so p does not divide k, 
contradicting p|k?. It follows that Kı = (conv K1) N Z?, as required. The line 
L{b, d] has slope (k? + 1)/k?, and since k? and k? + 1 are consecutive integers, 
they are relatively prime. Consequently, Kə = (conv K2) Z2, and the proof 
is complete. 

















The next two lemmas are rather general and will be useful also in subse- 
quent sections of the paper. 


Lemma 1. If Q is a P-polygon such that |P| > 2 and PAQ = 0, then Q does 
not meet any line through two points in P. 


Proof. Let pı and p» be different points in P, and without loss of generality, 
suppose that they lie on the z-axis and that Q is a P-polygon whose interior 
meets the upper open half-plane. Suppose that QN [p1, p2] Z 0. Let Lı be the 
lattice line through pı with minimal positive angle with the x-axis such that 
Lı contains vertices v; and v2 of Q. Without loss of generality suppose that 
Pı, v2, and v lie on L4 in that order. Since Q meets [p1, p2], by convexity the 
line Lə through pa and v2 contains a vertex v3 of Q with po, vs, and v2 in that 
order on Lz. But then the line L3 through pı and va has a smaller positive 
angle with the z-axis than L4, a contradiction. A similar argument applies to 
the case when Q meets the z-axis outside the segment [p1, po]. 














Lemma 2. Let P be a set of points in Z?. If there is a lattice P-polygon Q, 
then there are different convex lattice sets Kı and Kə with equal discrete point 
X-rays at the points in P. Moreover, if PAO Q = 0, then in addition conv Kı 
and conv Ka do not meet any line through two points of P. 


Proof. Let Q be a lattice P-polygon. Partition the vertices of Q into two 
disjoint sets Vi and V2, where the members of each set are alternate vertices 
in a clockwise ordering around 0Q. Let 


C = (ZnQ)J NW UV), (2.4) 


and let K; = CUV;, i = 1,2. Then Kı and Kə are different convex lattice 
sets with equal discrete point X-rays at the points in P. 

If PANQ = 0, then by Lemma 1, Q does not meet any line through two 
points of P and the second statement follows immediately. 
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Theorem 4. Let p; and pa be distinct points in Z? and let P = {p1, p2}. Then 
there is a lattice P-polygon Q with PAQ = 0, and hence two different convex 
lattice sets, with convex hulls disjoint from L[p1,p2| and with equal discrete 
point X-rays at the points in P. 


Proof. Without loss of generality, let p; = (0,0) and po = (k,0) for some 
k > 0. Then one can check that (2k, 2k), (3k, 3k), (3k, 4k), and (9k,12k) are 
the vertices of a lattice P-quadrilateral. The conclusion follows from Lemma 2. 














2.4 Discrete Point X-Rays at Collinear Points 


As we have seen, a convex lattice set is determined by its discrete point X- 
rays at two different points only in the situation of Theorem 2. Thus to have 
more general uniqueness results, we need more than two points. 

In fact, for collinear sets of points, at least four points are necessary. This is 
a consequence of the following theorem, proved in [7, Theorem 5.1] by means 
of some number-theoretical computations involving the Chinese remainder 
theorem, and its corollary. 


Theorem 5. If P is a set of three collinear points in Z?, there exists a lattice 
P-hexagon. 


Corollary 1. If P is a set of three collinear points in Z?, then convex lattice 
sets not meeting the line containing P are not determined by discrete point 
X-rays at the points in P. 











Proof. This is an immediate consequence of Theorem 5 and Lemma 2. 





To make progress, we require the following technical lemmas. 


Lemma 3. Let p € Z? and let F, and F> be finite subsets of Z? such that 
p [2 Fi U Fy and XyIhn = XpFo. Then |F] = | P|. 


Proof. Since p ¢ F1U Fy, we have for i = 1, 2, 


|E| = SO [Fin Go, u] t p)| = $ X,FiQ)- (2.5) 


ucs! uESt 














Let L be a lattice line in R?, and suppose that L is taken as the z-axis in 
a Cartesian coordinate system. For each finite set F in Z?, define 


V(F)= M 2 (2.6) 


eim ly| 


Then v is a measure in Z?, and we call L the base line of v. 
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Lemma 4. Let v be a measure defined by (2) with respect to the base line 
L. Suppose that F, and F> are finite subsets of Z? contained in one of the 
open half-planes bounded by L and with equal discrete point X-rays at p € 
LnZ?. Then the centroids of F, and F> with respect to v lie on the same line 
through p. 


Proof. Without loss of generality we may take L to be the x-axis, p = (0,0), 
and F, and F; finite subsets of Z? contained in the upper open half-plane. Let 
c; = (xi, yi) be the centroid of F;, for i = 1,2, with respect to the measure v. 








Then i 
x 
v(F;) (ewer, V 
and T 
=, 2.8 
vi = S (2.8) 
for i = 1,2. Therefore, 
Ti $5 (2/y) | M (XpFi(u)) cot 0(u) ' 
(z,y)€ Fi ue St 


for i = 1,2, where 6(u) denotes the angle between the x-axis and a line parallel 
to u. Since X Fi = X,F» and p d F U F5, we have |Fi| = |F2| by Lemma 3, 
and hence y1/zi = y2/x2, as required. 














Theorem 6. Let P be a set of at least three points in Z? lying in a line L. If 
there are different convex lattice sets not meeting L with equal discrete point 
X-rays at the points in P, then there is a rational P-polygon disjoint from L. 


Proof. Let Kı and Ko be different convex lattice sets not meeting L and with 
equal discrete point X-rays at the points in P. If LAconv Kı Æ 9, then clearly 
LN conv Kz # (). Then either for some 1 <i Z j < 3, conv Kı and conv Kə 
both meet [p;, pj], or for some 1 <i Z j € 3, conv Ky and conv Kə both meet 
L|pi. pj] \ [pi, pj], contradicting Theorem 2. 

Consequently, L N conv Kı = () and therefore L N conv Kə = Ø. Then we 
can follow exactly the proof of [9, Theorem 5.5] for discrete parallel X-rays, on 
replacing lattice lines parallel to directions in a set with lattice lines through 
points in P, replacing ordinary centroids with centroids with respect to the 
measure v defined by (2.6) with base line L, and using Lemma 4 instead of [9, 
Lemma 5.4]. Note that this argument uses only cardinality and collinearity 
properties and the fact that the centroid of a finite set of lattice points is a 
point with rational coordinates, a fact that still holds when centroids are taken 
with respect to v. Also, note that the observation that |U| > 4 in the second 
paragraph of the proof of [9, Theorem 5.5] is not needed. The conclusion is 
that there is a rational P-polygon disjoint from L. 
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Theorem 7. (i) Let U be a set of mutually nonparallel vectors in Z? such 
that there exists a lattice U-polygon and let L be a lattice line. Then for some 
set P of |U| points in L, there exists a lattice P-polygon disjoint from L. 

(ii) Let P be a set of at least two points in Z? in line L such that there exists 
a rational P-polygon disjoint from L. Let o be a projective transformation of 
IP? taking L to the line at infinity, and let U = P. Then there exists a lattice 
U -polygon. 


Proof. (i) Let Q be a lattice U-polygon and suppose that L is a lattice line. 
Let $ be a nonsingular projective transformation of P? such that $H,4 = L, 
where H is the line at infinity in P?, so that LN@Q = 0. If p € P? has rational 
coordinates (rational slope if p € Ha), then op also has rational coordinates. 
By translating Q, if necessary, we may assume that Q N $^! Hy = @. Then 
($Q) N Hæ = 0, so ¢ is permissible for Q and hence $Q is a rational QU- 
polygon, where QU is a set of |U| points in L with rational coordinates. Choose 
an m € N so that the |U| points in mQU and the vertices of m¢Q belong to 
Z?. Then m¢Q is a lattice m@U-polygon, and m@U is a subset of the line 
mL. Let w be a translation taking mL onto L and let P = v(mdQU). Then 
w(m@Q) is the required lattice P-polygon. 

(ii) Let Q be a rational P-polygon disjoint from L. Since the hypotheses 
ensure that Q is permissible for ¢ and L is a lattice line, $Q is a rational U- 
polygon. Then there is an m € N such that m@Q is a lattice U-polygon. 














Corollary 2. Let P be a set of points in Z? in a line L. Then convex lattice 
sets in Z? not meeting L are determined by discrete point X-rays at the points 
in P if either 


(a) |P| 2 7, or 
(b) |P| =4 and there is no ordering of points in P such that their cross ratio 
is 2, 3, or 4. 


On the other hand, it is possible that |P| — 6 and there exist different 
convex lattice sets with convex hulls disjoint from L and equal discrete point 
X-rays at points in P. 


Proof. Suppose that P is a set of points in Z? in a line L, such that convex 
lattice sets in Z? not meeting L are not determined by discrete point X-rays 
at the points in P. Then, by Theorem 6, there is a rational P-polygon disjoint 
from L. Theorem 7(ii) implies that there is a set U of |P| mutually nonparallel 
vectors such that there exists a lattice U-polygon. By [9, Theorem 4.5], we 
have |U| € 6, so |P| € 6 and (a) is proved. Moreover, if |P| = |U| = 4, [9, 
Theorem 4.5] implies that there is an ordering of the vectors in U such that 
their cross ratio is 2, 3, or 4. Since U is obtained from P by a projective 
transformation, and such transformations preserve the cross ratio, the same 
is true for P. Therefore, (b) is established. 

By [9, Example 4.3], there is a set U of six mutually nonparallel vectors 
such that there exists a lattice U-polygon. It follows from Theorem 7(i) that 
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there is a set P of six points in L N Z? such that there is a lattice P-polygon 
disjoint from L. The proof is completed by an application of Lemma 2. 














In particular it follows from the previous result that convex lattice sets not 
meeting the x-axis are determined by their discrete point X-rays at points in 
the set {(0, 0), (1,0), (2, 0), (5, 0)}. 


2.5 Discrete Point X-Rays at Noncollinear Points 


Voléié (see [12] or [8, Chapter 5]) proved that planar convex bodies are de- 
termined by their continuous point X-rays at any set of four points, no three 
of which are collinear. We see in this section that the situation is somewhat 
different for discrete point X-rays. 
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Fig. 2.1. A lattice P-hexagon for three noncollinear points. 


In [7, Theorem 7.1], Pappus’ theorem is used to prove that there are sets 
P = (pmi, p2, p3} of three noncollinear points in Z?, such that there exists a 
lattice P-hexagon Q with P N Q =. An example is depicted in Figure 2.1, 
where pı = (0,0), po = (210,0), and ps = (0,210). The “center” c of this 
hexagon, that is, the common intersection of the lines through pairs of opposite 
vertices, has coordinates (105/2, 105/2), so on multiplying each coordinate by 
2, we obtain an example where c is also a lattice point. The hexagon is then 
a lattice P'-polygon for a set P’ = (pi, p2, pa, c] of four noncollinear points. 
By Lemma 2, there are different convex lattice sets with equal discrete point 
X-rays from the points in P’. Such examples show that the results of Volčič 
(see [12] or [8, Theorems 5.3.6 and 5.3.7]) do not hold in the discrete case. 

A more elaborate construction in [7, Theorem 7.5] provides sets P of four 
noncollinear points in Z? such that there exists a lattice P-octagon Q with 
PANQ = 0. This and Lemma 2 show that another result of Volčič (see [12] or 
[8, Theorem 5.3.8]) also does not hold in the discrete case. 
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Theorem 8. There is a set P of four points in Z?, no three of which are 
collinear, such that convex lattice sets not meeting any line joining two points 
in P are not determined by discrete point X-rays at the points in P. 


Finally, it is demonstrated in [7, Theorem 7.6] that there exists a set P 
of six points, no four of which are collinear, for which there is a lattice P- 
dodecagon Q with PN Q = Ø. This can be combined with Lemma 2 to yield 
the following result. 


Theorem 9. There is a set P of six points in Z?, no four of which are 
collinear, such that convex lattice sets not meeting any line joining two points 
in P are not determined by discrete point X-rays at the points in P. 


2.6 Concluding Remarks 


At present, so far as we are aware, the only results on discrete point X-rays in 
the integer lattice beyond those in [7] are contained in as yet unpublished work 
by Katja Lord, who has obtained some instability results for discrete point 
X-rays analogous to those found by Alpers and Gritzmann [2] for discrete 
parallel X-rays. Suppose that m > 3, that p1,..., Pm are distinct lattice points 
in Z?, and that n € N. Lord proves that there are disjoint finite subsets Fj 
and F3 of Z?, of cardinality at least n, such that (i) both F, and F> are 
uniquely determined by their discrete point X-rays at the points p1,..., Pm, 
(ii) $5254 Xp, F1 — Xp, F5| = 2(m— 1), and (iii) |Fy Q9(F3)| € 3 (or € 4) for any 
affine transformation $ : IR? — R? (or projective transformation $ : P? — P?, 
respectively). Thus there is an even worse instability for discrete point X-rays 
than for discrete parallel X-rays, for which the results are the same except 
that in (iii) the bounds depend on m. She also extends these results to higher 
dimensions. 

According to a couple of experts in electron microscopy (private communi- 
cations), there might be a chance to realize discrete point X-rays 
experimentally for small samples. Apparently the STEM (scanning transmis- 
sion microscopy) (also called Z-contrast) technique lends some hope for this, 
though there would be some extra technical challenges. 

For the moment, however, discrete point X-rays lack the application in 
the material sciences enjoyed by discrete parallel X-rays. Nevertheless, the 
theory of discrete point X-rays seems worthy of study as interesting in its 
own right. In particular, lattice P-polygons, and indeed P-polygons generally, 
are fundamental incidence objects whose structure remains quite mysterious 
when the points in P are not collinear. 

In future work, the authors hope to tackle the obvious open questions con- 
cerning uniqueness: Are convex lattice sets in Z? determined by their discrete 
point X-rays at some set of three noncollinear points, or at any set of seven or 
more noncollinear lattice points in Z?? Complexity issues and reconstruction 
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algorithms, both for general lattice sets and special ones such as polyominoes, 
need to be addressed. Algebraic methods might also be applicable. Generally, 
it is likely that while some results will transfer from parallel to point X-rays 
by appropriate use of projective transformations, especially when the points 
at which the X-rays are taken are collinear, the noncollinear case may well 
require new techniques. 

We remark that the notion of a point X-ray in a graph has been introduced 
and studied from a similar point of view (questions of uniqueness and so 
on). The concept is, however, quite different from the discrete point X-rays 
considered in this article. For more details, see the work of Dulio [5]. 
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Reconstruction of Q-Convex Lattice Sets 


S. Brunetti and A. Daurat 


Summary. We study the reconstruction of special lattice sets from X-rays when 
some convexity constraints are imposed on the sets. Two aspects are relevant for a 
satisfactory reconstruction: the unique determination of the set by its X-rays and the 
existence of a polynomial-time algorithm reconstructing the set from its X-rays. For 
this purpose we present the notion of Q-convex lattice sets for which there are unique 
determination by X-rays in suitable directions, and a polynomial-time reconstruction 
algorithm. After discussing these results, we show that many reconstructions of sets 
with convexity and connectivity constraints can be seen as particular cases of the 
algorithm reconstructing Q-convex lattice sets. 


3.1 Introduction 


Let D be a set of directions, and B be a class of lattice sets (finite subsets of Z?). 
The main problem in discrete tomography (DT) is to reconstruct a member 
of B by its X-rays taken in the directions of D. For applications, the retrieval 
of the original lattice set is of interest. T'his is ensured by uniqueness of the 
reconstruction. Therefore, the first important question is to know whether 
any member of B is uniquely determined by its X-rays parallel to directions 
in D among the members of B. In case of positive answer, we say that B is 
determined by D. In general, if no assumption is made on the set, it cannot be 
uniquely reconstructed [5]. On the other hand, Gardner and Gritzmann have 
proved that a convex lattice set is completely determined by its X-rays parallel 
to some sets consisting of four directions [16]. Unfortunately, the proof of this 
result is not constructive: It does not give an algorithm that reconstructs 
the set from the given X-rays. The second question is to know what is the 
complexity of reconstructing lattice sets of B, that is, whether they can be 
computed in polynomial time in the size of the data. There are examples of 
classes of lattice sets for which there exist efficient reconstruction algorithms. 
In particular, in [4] it is proved that the hv-convex polyominoes (defined 
ahead) can be reconstructed from X-rays parallel to horizontal and vertical 
directions in polynomial time. 
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This result and the uniqueness theorem of Gardner-Gritzmann described 
above are a priori unrelated: The directions of the X-rays and the considered 
classes of lattice sets are different. In this chapter we introduce a third class of 
lattice sets, so-called Q-convez lattice sets, which strongly links the two results. 
Q-convexity, which will be defined precisely in Section 3.3, is a property of 
lattice sets that depends on a set of directions. Generally, the directions of Q- 
convexity are the same as the ones of the X-rays. In Section 3.4, we show that 
Q-convex lattice sets are completely determined by X-rays in the same set of 
directions that distinguish convex lattice sets. An algorithm that reconstructs 
Q-convex lattice sets is presented in Section 3.5, extending the algorithm for 
hv-convex polyominoes. 

If we consider Q-convex lattice sets w.r.t. the directions of uniqueness, 
the reconstruction algorithm for Q-convex lattice sets can be exploited for 
reconstructing convex lattice sets. This represents one case where the fact 
that lattice sets of the class are uniquely determined by a set of directions 
can be exploited to compute members of the class in polynomial time. There 
are a few other classes that are both uniquely determined by X-rays and 
reconstructible in polynomial time: We cite the class of additive sets that are 
reconstructed in polynomial time by use of linear programming [15], and the 
class of directed column-convex polyominoes associated with the horizontal 
and vertical directions [13]. 

We conclude the chapter by showing in Section 3.6 the consequences on 
other computational problems in DT, and in particular how already known 
reconstruction results can be proved again using Q-convexity. 

Where some proofs are only sketched in this chapter, the reader can refer 
to the cited articles for the details. Especially, the result of Section 3.4 has 
already been published in [11], but the proof is only sketched in this chapter. 
The algorithm described in Section 3.5 is the same as the one published in [6] 
except for the last step (reduction to 2-SAT), which is faster and simpler in 
this chapter. 


3.2 Definitions and Preliminaries 


We first give some notations and classical definitions we use in the chapter, 
and then we recall some known results that are relevant for our discussion. 


3.2.1 Definitions 


A lattice direction is represented by a vector p = (a,b) 4 (0,0) with a,b co- 
prime integers. Any straight line parallel to p has equation bx — ay = k where 
k is a constant. By convenience, we define the function p by p(x, y) = bx — ay. 
So if Mı and Mə are two points on the same straight line parallel to the 
direction p, then p( Mı) = p(M2). The straight lines of direction p are called 
p-lines, and sometimes we denote by p = k the line of equation p(x, y) = k. 
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The horizontal (respectively, vertical, diagonal) direction (1,0) (respec- 
tively, (0,1), (1, —1)) is denoted by h (respectively, v, d), and so, h(a, y) = 4); 
v(z,y) = x and d(z,y) = x +y. 

If p; = (ai, bi), i =1,...,4, are four different lattice directions with slopes 
(possibly infinite) 4; = —b;/a;, then the cross ratio of these four directions, 
denoted by [p1, p2, p3, pa], is the quantity 


(As — Ai) (Aa — Aa) 
(As = A2)(Aa — 1) ` 


The ordered cross ratio of p1,..., p is [Po(1); Po(2); Po (3); Po (4)], Where o is the 
permutation such that Ag(;) < Ac(i41). The ordered cross ratio of four lattice 
directions is always a rational number that is greater than 1. 

Let E be a lattice set. The X-ray of E parallel to a direction p is the 
function X, E : Z —^ No defined by X,E(k) = [M € E | p( M) = k}|, where 
|S| denotes the cardinality of a set S. 

Let B be a class of lattice sets and D be a set of directions. We say that 
D determines the class B of lattice sets if for every sets Ej and E of B, we 
have 


[p1. P2, P3, pa] = (3.1) 


(Vp ED, Xp ky = XpE2) — F = E2. (3.2) 


In this chapter we deal with special lattice sets having convexity and con- 
nectivity constraints. Let us recall some basic definitions. 

A lattice set is convex if it is the intersection between a convex polygon 
and Z?. We denote the class of convex lattice sets by C. A lattice set E is 
line-convez along a direction p if the intersection of any line parallel to p 
and E is empty or it is the set of the points with integer coordinates in a 
straight line segment. So, a convex lattice set is also line-convex along all 
directions. T'he class of line-convex sets along the directions in D is denoted 
by £(D). A lattice set that is line-convex along the horizontal and vertical 
directions is said to be hv-convex. A 4-path (respectively, an 8-path, a 6- 
path) is a finite sequence (Mo, Mi, ..., Mn) of points of Z? such that M;,1 — 
Mi is in the set {(£1,0), (0, +1)} (respectively, {(£1,0), (0, +1), (+1, +1)}, 
{(#1, 0), (0, +1), (1, 1), (—1, —1)}). For k = 4,6,8, a lattice set E is said to be 
k- connected if, for any A, B in E, there is a k-path from A to B. The class of 
the k-connected lattices sets is denoted by Pk. So we have that Ps C Ps C Ps. 
A 4-connected lattice set is also called a polyomino. 
































3.2.2 Previous Results 


Here are two important theorems that are the bases of the results of this 
chapter. 


Theorem 1. /16] A set D of lattice directions determines the class C if, 
and only if, it contains four directions whose ordered cross ratio is not in 
(4/3,3/2,2,3, 4). In particular, any set of seven different lattice directions 
determines the convex lattice sets. 


34 S. Brunetti and A. Daurat 


For example, the set of directions ((1,0), (0, 1), (2, 1), (1, —2)} determines the 
convex lattice sets. 


Theorem 2. [/] Let D = (h,v). The reconstruction problem for P4 £(D) 
(hv-convex polyominoes) can be solved in polynomial time. 


'The following sections generalize these results to a common class of lattice 
sets. A natural candidate is the class of line-convex sets; unfortunately, neither 
of the two theorems holds for this class: 


(a) No finite set of lattice directions determines £(D), for any fixed D. This 
result is a simple consequence of Proposition 3.6 of [5]. 

(b) The reconstruction of an hv-convex set from the horizontal and vertical 
X-rays is NP-hard [23]. 


3.3 Q-Convexity 
If p = (a,b) and q = (c, d) define two lattice directions and M is a point of 


R?, then the lines of directions p and q through M determine the following 
four quadrants (see Fig. 3.1): 


Zo (M) -(M'eZ^|p(M')zxpM)andq(M') xq(M)), X (333) 
ZI (M) = (M'eZ^|p(M') > p(M)andq(M') q(M)), (34) 
Zi(M)-(M'eZ^|p(M')2p(M)andq(M') 24q(M)), — (3.5) 
Z3"(M) -(M'eZ^|p(M') < p(M) and q(M’) > q(M)). X (3.6) 





hd q —4(M) 


Zo (M) Z(M) 





Fig. 3.1. The four quadrants around a point M w.r.t. the vertical and horizontal 
directions (f(x,y) = z, (x,y) = v). 


Definition 1. A lattice set E is Q-convex (quadrant-convez) w.r.t. D = {p,q} 
if ZI (M)n E z 0, for k € {0,1,2,3}, implies that M € E. 
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Definition 2. A lattice set is Q-conver w.r.t. a set D of directions if it is 
Q-conver w.r.t. every pair of directions included in D. 


We denote the class of the Q-convex sets w.r.t. D by Q(D). Figure 3.2 
illustrates some examples of @-convex lattice sets. The following proposition 
shows the links between the different notions of convexity. 


Proposition 1. Let E be a lattice set. 


(a) If E is convex, then E is Q-convex w.r.t. any set of directions. 

(b) If E is Q-convez w.r.t. D, then it is line-convex along D. 

(c) If E is hv-convex and 8-connected, then it is Q-convez w.r.t. the horizontal 
and vertical directions. 


This can be summarized by 
Cc QD) ELOD), |. PsN L({h, v}) € O({h, v}) . (3.7) 


Proof. (a) Suppose that E is convex, and p,q are two lattice directions. Let 
M be such that each Z?4(M) contains a point N; of E. As the point M 
is in the convex hull of the N;, by convexity it is in E. 

Suppose that E is Q-convex w.r.t. D and M is in a segment [A, B] parallel 
to a direction p € D with A, B € E. Let q be another direction of D, 
and suppose for example that q(.A) € q(M) < q(B). Then we have A € 
Zi (M)n ZB" (M) and B € Z'* (M)n ZI" (M), so by Q-convexity M € E. 
Suppose now that E is hv-convex and 8-connected. Let M be such that 
each Z/"(M) contains a point N; of E. For each i there is an 8-path 
joining N; and N;,1 included in E, and so there is a point A; in Z?’(M) 
that satisfies h(A;) = h(M) or 6(A;) = 6(M). Then it is easy to see that 
M lies in a horizontal or vertical segment [A;A,;], and so M is in E by 
hv-convexity. 


(b 


mH 


— 
e 
SEE, 














SR. uos AOV EA m 
pov ee aa i ient 


Fig. 3.2. (a) A set that is line-convex but not Q-convex w.r.t. (h, v). (b) A Q-convex 
lattice set w.r.t. {h, v}. (c) A Q-convex lattice set w.r.t. {h, v, d}. 
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3.4 Uniqueness Result 


The aim of this section is to explain and sketch the proof of the following 
result extending Theorem 1 to Q-convex lattice sets. 


Theorem 3. /11] Let D be a set of lattice directions. Then D determines the 
class Q(D) if, and only if, it contains four different directions whose ordered 
cross ratio is not in {4/3, 3/2, 2,3, 4}. 


Proof. By Proposition l(a), C € Q(D). Hence, if D determines Q(D), then 
it determines C. By Theorem 1, the set D contains four different directions 
whose ordered cross ratio is not in {4/3, 3/2, 2,3,4}. This completes the proof 
of the “only if” part. The proof of the “if” part is sketched out in the rest of 
this section. 














'To describe the fundamental steps of the proof, we need some definitions, 
which are based on the ideas from [16]. 


Definition 3. A D-polygon P is a convez polygon such that any line of di- 
rection in D contains either no or two vertices of P. 


Definition 4. Let Zm denote the set of integers modulo m. A D-sequence is 
a sequence (Ax)kez,, of m points of R? such that m is even and for any p in 
D there is an s € Zm such that 


p(As-1) < p(As_2) SSS B(As— m), 
P(As) «P(Asa) <--> < P(Asc m1). 


(3.8) 


Notice that the vertices of a D-polygon always form a D-sequence, but 
the converse is not true: Fig. 3.3 shows a D-sequence, with D = ((1,0), (0, 1), 
(1,1), (1, 2 1)), which is not a convex polygon, because the point A» is in the 
interior of the convex hull of the points Ao, Ai,..., A15. 

The “if” part of Theorem 3 is a consequence of the following three lemmas. 


Lemma 1. If two distinct Q-convez lattice sets have the same X-rays in D, 
then there exists a D-sequence. 


Proof (sketch). Let us assume that two distinct Q-convex lattice sets E; and 
E» have the same X-rays in D. Let Fy = Ei \ E» and F5 = Ez \ Ey. We say 
that two points A and B of F; are related by ~ (which is denoted by A ~ B) 
if there exist two directions p,q € D and a quadrant ZP*(N) with N € R? 
containing A and B and such that ZP*(N) n F> = Q. Similarly, we define the 
relation ^ on the points of Fy. Then the following properties can be proved 
in this order: 


(a) The relation ~ is an equivalence relation on Fi U F5. 
(b) If A, B € Fy, A’, B' € Fy and p € D are such that p( A) = p(A’), p(B) = 
p(B’), then A ~ B implies A’ ~ B’. 
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Ais A14 
A13 
A 
i Arz 
A 
Az Au 
3s A10 
Ao 
A4 
As Ag 
Ag Az 


Fig. 3.3. A D-sequence that is not a D-polygon. 


(c) The set of the centroids of the equivalence classes of ~ can be ordered in 
a D-sequence. 


This construction is illustrated in Fig. 3.4 (for details, see Section 3.3 of 
[11].) 


For the two next lemmas we need another definition: 














Definition 5. An affinely regular polygon is the image of a regular polygon 
by an affine transformation of the plane. 


Lemma 2. If there exists a D-sequence, then there exists an affinely regular 
D-polygon. 


Proof (sketch). If (Mi)xez,, is sequence of points, we define the midpoint se- 
quence $((My)) as the sequence (My + Mx+1)/2),¢z,, formed by midpoints 
of consecutive points of Mz. 

To prove the lemma we suppose that there exists a D-sequence (Ax) kez, . 
We can suppose that the centroid of (Ax)xez,, is (0,0). Then we consider the 
sequence (Pa) = $?"((A,)) of sequences of points obtained by iterating the 
transformation 9? to (Ag). Classical linear algebra shows that cos ?^(£)(P;) 
converges to an affinely regular polygon P. Moreover, as each P, satisfies 
(3.8), the polygon P also satisfies it and so is a D-sequence. Thus, finally P 
is an affinely regular D-polygon. 














Lemma 3. Let D be a set of lattice directions. If there exists an affinely regu- 
lar D-polygon, then four directions of D have a cross ratio in (4/3,3/2,2,3,4). 


This lemma was already used for the proof of Theorem 1 (see Lemma 4.3.7 
of [17]). Its proof, which is omitted in this chapter, uses the complex roots of 
the unity and p-adic analysis. It achieves the proof of Theorem 3. 
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Fig. 3.4. The construction of a D-sequence from two Q-convex sets that have the 
same X-rays. (D = {(, 0), (0, 1), (1, 1); (1, -1)) 


3.5 Complexity Result 


Let D be a set of directions, and $ be a class of lattice sets. Now we introduce 
the reconstruction problem for 6 and D. 

RECONSTRUCTIONg(D) 

Input: A map f : D x Z — No with finite support. 

Output: A set E € B, if it exists, which satisfies 


X E(k) = f(p,k), for all (pk) €D xZ. (3.9) 
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This section focuses on the reconstruction problem for the class Q(D) of Q- 
convex lattice sets w.r.t. D. 


3.5.1 Reconstruction from Exact X-Rays in Two Directions 


In this subsection the set of directions D consists of two directions: p = (a, b) 
and q = (c,d). 

If M is a point of Q? such that p(M) = i and q(M) = j, we represent 
it by (i, j) p,g. (We sometimes omit the index, when the context is clear.) Let 
us emphasize that (i, j)p,q is not always a lattice point. More precisely, there 
exists & € {0,...,5—1} such that (i, j)p, € Z? if, and only if, j = Ki (mod ô), 
ô denoting | det(p, q)| = |ad — bc] (see, for example, [12]). 

We are going to describe a polynomial-time algorithm that solves 
RECONSTRUCTION Q({p,q}) (1p; 4)). Given a function f in input, let pmin = 
min{k | f(p, k) > 0}, pmax = max(k | f(p, k) > 0}, amin = min(k | f(q, k) > 
0}, gmax = max{k | f(q, k) > 0). We can easily compute the grid 


g — (M € Z? | pmin < p(M) < pmax, qmin < GM) < qmaz} — (3.10) 


in polynomial time by the input data. Notice that any solution to the recon- 
struction problem is a subset of G and has the cardinality: 


s= YO fei)= P», fei. (3.11) 


pmin<i<pmax qmin<j<qmax 


If this last equality is not satisfied by f, then there is no solution to the 
reconstruction problem, so we can assume it. 

Let a and be two subsets of G, and let us denote the set of all elements 
E € O({p, q}) that satisfy (3.9) and a C E C B by E(a, B). The reconstruc- 
tion problem consists in determining whether £((), G) is empty or not, and in 
reconstructing a member of it in the latter case. 

In the algorithm we use two variable sets a and 0 satisfying a C 8 CG 
and such that E(a, 3) is invariant. The general sketch of the algorithm is the 
following: 


1. We choose two points U1,Us € G such that p(U1) = pmin and p(U2) = 
pmax, and the set a is initiated to ao = (U1, U2}. These points are called 
the p-bases. 

2. We compute a set Bo such that £(ao, €) = €(ao, Go) and each p-line p = i 
has less than 2f (p, i) points of 8o. The variable set £ is initiated to ĝo. 

3. We iterate some “filling operations" that add points to a and delete points 
to B but let E(a, 8) invariant. 

4. Finally, the special form of the obtained o, 3 permits us to use a 2-SAT 
algorithm to compute an element of £(o, 3) if the latter is not empty. 


Because the choice of the p-bases, which is done in step 1, is completely 
arbitrary, steps 2, 3, and 4 must be repeated for all the possible positions of 
the p-bases. 
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The rest of the section is devoted to the detailed description of steps 2, 3, 
and 4 of the algorithm. 


Computation of the Initial Set Go 


At this step we suppose that ag = (U1, U2} C G, p(U1) = pmin and p(U2) = 
pmax and we will compute a suitable Bo such that E(ao, 8o) = E(ao,G). We 
also make the hypothesis that q(U1) < q(U2). (The case q(Ui) > q(U3) would 
be studied in an analogous way.) 

First, we present some lemmas that give sufficient conditions for a quadrant 
to contain a point of any Q-convex lattice set E that satisfies (3.9). 

For this we define the four partial sums: 


S)= ^ f(»i S= M, fei); 


pmin à! «à ii <pmax 


(3.12) 
S()- MM, fas); BA= M, fas; 
qmin&j'«j j&€j' qmaz 
for i = pmin,...,pmax and j = qmin,...,qmaz. If M is a point of Q?, 


Sx ( M) will denote Sy (p( M)) if k = 0,2 and Si (q(M)) if k = 1,3. These sums 
satisfy the following easy but fundamental lemma: 


Lemma 4. Let M = (i, j)p,4 with i,j € Z (notice that M is, in general, in 
Q?). If S,(M) + Sy41(M) > S, then EN Z(M) # 0 for any E € €(0,9), 
where k = 0,1,2,3, and k+ 1 — 0 for k — 3. 


Proof. At first we take k = 0 into consideration. If EM Zo(M) = 0, then 
So(M) + $3(M) = |EN (Za(M) U Z1(M))| € S. Cases k = 1,2,3 can analo- 
gously be proven. 














For each i such that f(p,i) » 0, we can define two q-indices, as follows: 


a; = min{j | $1(j) + S2(i) > S}; (3.13) 


Lemma 5. If f(p,i) > 0, then a; € bi, for i = pmin,...,pmaa. 


Proof. By (3.13) we have that S1(a; — 1) + S2(i) € S. Since S1(a; — 1) = 
S — Ss(a;) and S3(i) = S — So(i — 1), the inequality can be rewritten as 
S3(a;) + Soli — 1) > S. If f(p, i) > 0, then So(i — 1) < So(i) and therefore, 
S'3(a;) + So(i) > S. In view of (3.14), this implies a; < b;. 

















Now we can define a third index cj as follows: 
(a) If aj; < q(U1), then c; = q(U1). 
(b) If q(U1) < Qj x bj < q(U2), then Ci = Qi. 
(c) If b; > q(U2), then c; = q(U2). 
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Let C be the point (i, ci) pq. 
Lemma 6. Let E € £((U1, U3), GV. If f(p,i) > 0, then 


Zi(C) NE 4 0, Vk € {0,...,3}. (3.15) 


Proof. (a) If a; < q(U1), we have C = (i, q(U1))5,4 and so U1 € Zo(C)NZ3(C) 
and U2 € Z2(C) because q(U1) < q(U2). Moreover, by the definition of a; 
it follows that S1(C) + S2(C) > S and then, by Lemma 4, we conclude 
that Zi(C) EF (. 

(b) If q(U1) <a < bi SS q(U»), then C — (i, Qi) p,q: So, U, € Zo(C) and 

Uz € Z2(C). By the definition of a;, S1(C) + S2(C) > S and therefore 

Zi1(C) n E z 0. Finally, we use the fact that ¢(C) = a; € b; and $3(C) + 

So(C) > S to conclude that Z3(C) n E z 0. 

If b; > q(U3), then we have C = (i, ((U2)) 5,4. It follows that Uz € Z1(C) 

Z2(C) and U, € Zo(C). By the definition of bj, S3(C) + So(C) > S and 

so Za(C) n E z 0. 


— 
fo) 
XA 














If C is a point of Z?, Lemma 6 proves that C belongs to E, and we could add 
the point C to the set o. It is in fact the strategy used for the reconstruction 
of hv-convex polyominoes from horizontal and vertical X-rays in [14]. But 
for general directions, C can be outside Z?, and we must consider the set 8 
instead of o. Let us define: 


Bo = (05,3) | F(p, i) > 0, F(q, j) > 0 and ci — ôf (p, i) < j € APU " 
3.16 


Lemma 7. €({U1,U2},G) = £((U1, U2}, Bo). 


Proof. Let E € £((U1, U2},G) and i € (pmin,..., pmax} such that f(p,i) > 
0. Define c; = max(j = «i (mod ô) | j € ci, A= (i cj)p,, and B = (i, c + 
0)p,; we have c; € c; € c; +ô and A, B € Z?. We are going to show that 
E {A,B} 4 0. Lemma 6 states that Z;(C) Nn E z 0, for any k. Since 
f (p, i) > 0, there exists a point N € E such that P(N) = i. 


(a) If q(N) € q(C), then N € Z(A) n Z1(A). We have Z2(C) 
Z3(C) € Z3(A), and therefore Za(A) N E # 0, Z3(A)N EF 
Q-convexity of F, we deduce A € E. 

(b) If q(N) > q(C), then N € Zo(B)NZ3(B). Since Zo(C) € Zo(B), Z1(C) € 
Zi1(B), by the same arguments as above we can conclude that B € E. 
So (A, B} n E 7 0, then {j | (i,j) € E} C [q(A) — 9((p.3) — 1),4(B) + 

ó(f(p,i) — 1)] € [ci — óf(p,i) + 1,c; + ôf(p,i)]. As this property is true for 

any i such that f(p,i) > 0, it proves that E € E({U1, U2}, Bo}. 


(4), 


C Z2 
(. By the 














By the definition of 39, we also have: 
Lemma 8. |{M € 4 | (M) = i| < 2/ (p.i). 
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The Filling Operations 


An operation that transforms a, into a’, 8’ such that E(a, 8) = Ela’, 8) 
and a Ca’ C f C f is called a filling operation (see [4, 14, 22]). 
In this section we define some filling operations that will be iteratively 
applied to a = (U1, U2} and 8 = Go, where bo satisfies Lemmas 7 and 8. 
Denote the set of points of 8 on the ith p-line (jth q-line) of G by 85 (81), 
and the set of points of a on the ith p-line (jth q-line) of G by aj, (af). We 
also define 


l(a5) = amin, (M), r(oj)— max q(M), (3.17) 
(A5) = min (M), r(8;) = max q(M) . (3.18) 


We list here the four operations ©,@,©,© already described in [4], adapted 
to any direction p. 


(a) If o5 # 0, then Gai = {(i, j) | (o2) € j € r(o2)]- 

(b) Bah = {li 3) | (8) - f (p, i) < j < UG) tòf} 

(c) If ap # 0, 6,3) € 6, with jx l(o5), then Of, = {(i,j) € bp |j jj. 
If o? £0, (i, 9") € bp with j' > r(o5). then o8, = ((55) € Bp | 3 < j')- 

(d) Ifa # 0, then 68, = {(i, j) € Bp | r(o5) 0f (p.i) < j < Uap) tif (p, i)}- 
To these four operations we add a further operation denoted by ©’, which 


allows us to delete in 3 a sequence of consecutive indeterminate points of the 
ith p-line, when the sequence is shorter than f(p, i): 


obi = BiN U  (&525e8ly«i«3). (8.19) 
(3^) (G3 ) eZ2NB 
0j" —j' <6 f (p.i) 

The filling operations on the g-lines are defined analogously. Figure 3.5 
illustrates an application of these operations. 

'The algorithm performs all these operations on the p-lines and on the q- 
lines, and repeats this procedure until a É 8 or no further changes in a and 
B are produced. If we obtain a Z 3, then E({U1, U2}, Z2) = Ø. Therefore, 
the algorithm chooses two different p-bases and tries again. If a = 0, then 
£((U1, U2}, Z?) = £(o, 8) C {a}, and so, in order to conclude, it only remains 
to check if œ is Q-convex. Besides we can obtain the case in which a and 8 
are invariant with respect to the filling operations and a C 6, so that 8 \ a is 
not empty. 


The Types of Lines 


In the previous section we have obtained two sets a, 3, which are invariant by 
the filling operations and which satisfy: 


{U1,U2} Ca C BC Bo. (3.20) 
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Fig. 3.5. The filling operations. 


It is possible to show that a and ( have very particular forms on the p-lines 
and q-lines. 
More precisely, the line p — i is of type 


(a) to if b; = 0, 

(b) t if o, #0; then we have: 
ap = (06,3) | (05) € j € r(05)] (3.21) 
Bp = (65,3) | (05) — C (p, i) — lagl) € 3 € (o5) + 80.3 — le ; 


(c) t2 if af = 0 and Ø} is composed of 2/(p, i) consecutive points. So we have 


B5 = (53) | Bp) € j < 1055) + 26f(p,2)) ; (3.22) 


(d) ts if at, = Ø and GF consists of two separated sequences of f(p, i) points. 
So: 


B, = (531 (B5) <7 B5) + 8(F(, 4) -1) or 
r(B) — (f(p, i) - 1) < j € r(85)] (3.23) 


and 
r(85) — 1(85) 2 26f(p, i). (3.24) 
Since we know that 8 C £o, thanks to Lemma 8 we can claim that: 


Proposition 2. After performing the filling operations, every p-line is of type 
either to or ty or to. 
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By Proposition 2, we deduce that 
KA = 2f(p,i)— [24 for allt = pmin,...,pmaz . (3.25) 
By summing over i, we have 
|G| = 28 — |a] . (3.26) 


Consider now the q-lines, and suppose that the jth q-line contains indetermi- 
nate points. Thanks to the operations & and ©’, we have 


|G3| = 2f (a. 3) — laž], (3.27) 
and, therefore, 
I] = SCG = J Eaj) — lo = 25 - lel . (3.28) 
j J 
By (3.26), we deduce 
KA =2f(q,j)— laž] for all j 2 qmin,...,qmaz . (3.29) 


We note that this result permits to determine the type of the q-lines. Indeed, 
when |a2| > 0, we know that the ith q-line is a line of type tı. If [o2] = 0, 
then we have 183| = 2f(q,j); thanks to the operation ©’, this means that 
the set Bi is made up of two sequences having the same length, being either 
consecutive (in this case the ith q-line is of type t2) or separated (in this case 
the ith q-line is of type t3): 


Proposition 3. After performing the filling operations, every q-line is of type 
either to or ty or t2 or ts. 


Reduction to a 2-SAT Formula 


In the previous subsections we have found two sets oa and 8 such that 
E((U1, U2}, Z?) = E(a, 8), and every p-line and q-line is of type t;, i = 0,...,3. 
We are going to see that this last property allows us to find an element of 
E(a, 8) by reducing the problem to 2-SAT. We present here a new assign- 
ment of variables to points that permits us to have a lower complexity for this 
reduction to 2-SAT than in [6]. 

We first recall the 2-SAT problem. Given n, Boolean variables Vi,..., Vn,, 
a 2-SAT formula is a conjunction of clauses, where each clause is a disjunction 
of two variables or negations of variables, so it has the form 


Ne 


N (Wax-1 V Wak), (3.30) 
k=1 
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where ^ (respectively, V) denotes the and (respectively, or) operator and Wx 
is a variable V; or a negation of variable Vi. In fact, a clause of a 2-SAT 
formula can also be a single variable, a negation of variable, an implication, 
or an equivalence of variables because we have the equalities: W — (W v W), 
(W 2 W’) = (Wv W’), (W SW’) =((W => W')A(W’ = W)). The 2-SAT 
problem is the following. 

Input: A 2-SAT formula F of the form (3.30). 


Output: If it exists, an assignment of the variables V; satisfying F. 


Theorem 4 ([1]). 2-SAT problem can be solved in O(ne + ny)-time, where 
ne is the number of clauses of F, and ny is the number of Boolean variables 


of F. 


We will use this result to solve our problem. For it, we associate five Boolean 
variables to each point M of G. 


(a) W(M) expresses the presence of M in the final solution. Hence W(M) is 
true if M is a point of the solution. 

(b) V;(M) expresses the presence of points of the final solution in Z? (M), for 
i = 0,...,3. Hence V;(M) is true if the intersection of the solution with 
ZP* (M) is not empty. 


Each instantiation of the Boolean variables W (M) gives aset a C F(W) C 
B where 
F(W)={M € G| W(M) = true}. (3.31) 


Now we construct a 2-SAT formula F whose variables are (W(M))meg and 
(V;(M))i=0...3,meg such that the truth of F implies that F(W) is a Q-convex 
lattice set having the given X-rays. 

Now we describe the different steps of the construction of this formula. 


Coherence 
As we want a C F(W) C 8, we impose that the following variables are true: 


W(M) for M €q; (3.32) 
WOD for M¢ 8. (3.33) 


Besides, for every M and N such that M € Z;(N) as Zi(M) € Z;(N), we 
must express the formulas 


Vi(M) => VN). (3.34) 


Formulas (3.34) can be imposed by the O(n?)-long following formulas: 
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N VolM) > Vol); 
M,NEG 
Bp(N)—5<p(M)<p(N) 
q(N)—5<q(M)<q(N) 


A vil) > V); 


M,Neg 
B(N)-ó2p(M)ZB(N) 
q(N)—ó€q(M)Eq(N) 


A — Va(M) = VN); 


M,Neg 
B(N)—ó2p(M)ZB(N) 
q(N)—ó2q(M)ZQ(N) 


N V3(M) = V3(N) . 
M,NEG 


B(N)—6<p(M)<p(N) 
q(N)—ó2q(M)Zq(N) 


(3.35) 


Additionally, since if M is in F(V), then Z;(M) contains a point of F(V), 
we also impose the following formulas: 


W(M) = V(M). (3.36) 
Formulas (3.35) and (3.36) imply that 


V WN) | sv), (3.37) 
NEZ;(M) 





and so V;(M) implies F(W)nZ;(M) = 0. (Notice that the converse is generally 
not true.) 


Expression of the Q-Convezity 


Now we impose that the set F(W) is Q-convex. We could express it directly 
by Vo(M)AVi(M) ^ V2(M) ^ Va( M) = V(M), but it is not a 2-SAT formula. 


Remark 1. Let M = (i,j)p,q be a point of Z? \ o that verifies one of the 
following properties: 


(a) d = j (respectively, p = i) is a tı line or a tə line. 
(bg =j is a tz line such that r(92) — o(f(q,j) — 1) € i or i € I(82) + 

o(f(q, j) — 1). 

Then, one of the two semi-lines Aj (M) = ((7,j) | i’ € i} and AZ (M) 
(6,3) | E > a} (respectively, A; (M) = (4,5) | j < j} and AFM) 
(0,3) | 7’ > j}) contains a point of E for any E € E(a, p). We denote 
this semi-line by A,(M) (respectively, A;(M)). Indeed, under the previous 
hypothesis: 


(a) if the line is of type tı, then A;(M) is the semi-line containing a point of 


al; 
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(b) if the line is of type t2 or t3 and M € B, then A,(M) is the semi-line 
containing all the points of 57; 

(c) if the line is of type t2 or t3 and M € 8, then we have A; (M)N Aj (M) = 
(Mj C B] and |B7| = 2f(q, j). So, one of the semi-lines verifies |A,(M) N 
B2| > f(q, j). This semi-line contains at least one point of any E € E(a, 8). 


As a summary, if 1(67) - ó(f(q, j) — 1) 2 i, then A,(M) = AT (M), whereas 


if r(82) — 6(f (q, j) — 1) < i, then A,(M) = A; (M). 


Now we can express the Q-convexity of F(W) around any M € G by means 
of a 2-SAT Boolean formula. 


(a) If M € a, then there is nothing to express. 
(b) If M ¢ 8, let Zi, (M) and Z4 (M) contain neither U; nor U2. Then the 
Q-convexity is simply expressed by 


V4 (M) V V5 (M). (3.38) 


(c) If M € B N a, then we are under the hypothesis of Remark 1 and so 
only one quadrant Z;(M) exists that contains neither Ap(M) nor Aq(M). 
Therefore, Q-convexity is expressed by: 


Vi(M) > W(M). (3.39) 
Expression of X,E(j) = f(a, i) 


Fix the q-line g = j. This line is of type t; with i € {0,...,3} and, so, there 
are exactly 2(/(q, j) — |a% |) unknown points on jth q-line. If the line is of type 
tı or t2, and A = (i, j)p¢q,B = (i + ôf (p, i), j)p,q € GB NV Q, then for any set 


E € E(a, B), we have 
A € E if, and only if, Bé E, (3.40) 


and so we can express X,E(j) = f (q, j) by the formula 


EQ;2- A  W((3)eW(Q^.3). (3.41) 
(5,3), ,3)€8Na 
i —i=6 f (4,3) 


If the line is of type ts, then this line is made up of two sequences of consecu- 
tive indeterminate points. Since we know that each set E € £(o, 3) contains 
exactly one of these sequences, in this case we can express X,E(j) = f (q, j) 
by the formula 


A Wi e» WUD), (3.42) 
(i3), ,3)€8No 
i'—i»óf(q3) 


which is equivalent to the shortest formula 
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EQ; = (WUBI, I)  W(r(0).3))) 


^ A WEDS WEF 4,5))) (343) 
(4,3), (i+ô, j) €8No 
In the same way we can express that X,E(i) = f(p,i) by a similar formula 
EP. 


Final Reduction 


Let F be the conjunction of the formulas (3.35), (3.36), (3.38), (3.39), EP,, 
EQ;j. By construction, we have 

Lemma 9. If E € E(a, 8), then W(M) = “M € E" and V;(M) = "Zi(M)n 
E 4 (" satisfy F. Conversely, if W(M),V;(M) satisfy F, then F(W) € 
E(a, B). 

We can use the algorithm of Theorem 4 to check if E(a, 8) is not empty, and 
if this is the case to compute a solution of the reconstruction problem. 


Complexity 


'The complexity of the algorithm depends on the cost of the following steps. 


(a) There are O(n?) choices for the base points. 

(b) The computation of 39 can be done in O(n?)-time. 

(c) The filling operations can be implemented in O(n?) (see [6, Section 6.1]). 

(d) The reduction to 2-SAT takes O(n?)-time, since the formula F has the 
form (3.30) with n, = O(n?) and ne = O(n?) . 

(e) Finally, the assignment of the variables can be done in O(n?)-time by 
Theorem 4. 


Theorem 5. RECONSTRUCTION Q((5,4) (1p. q)) can be solved in O(n?)-time, 
where n = max(pmax — pmin + 1, qmaxz — qmin +1). 


Remark 2. In [19] the author gives an algorithm that applies the filling op- 
erations €, 6, $, O until invariance in time O(n? log(n)). Unfortunately, this 
algorithm cannot be easily extended to the operation C/. 


3.5.2 Reconstruction from Exact X-Rays in More Than Two 
Directions 


'The reconstruction algorithm for Q-convex lattice sets w.r.t. two directions 
can be easily extended to any set D of cardinality greater than two. In this 
case, the solutions to the problem are subsets of the grid 


g-(MeZ'|vpeD min(k| fk) »0) SPM) € — (3.44) 
max(k | f(p,k) > 0} ) . 
We list here the steps to be performed. 
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(a) We choose the direction minimizing max{k | f(p, k) > 0}—min{k | f (p, k) 
> 0}. Suppose it is p: We fix the p-bases U1, Us. We also fix another 
arbitrary direction q. 

(b) We compute the set y exactly by the same method described above. So 
we only consider the data corresponding to the directions p and q in f. 

(c) We apply the filling operations for all the directions of D. At the end, all 
the lines parallel to any direction of D \ p are of type ti, 4 = 0,1,2,3 by 
Proposition 3, and p-lines are of type to, t1, t2 by Proposition 2. 

(d) The problem is reduced to 2-SAT by using 1 + 4(%) variables for each 
point M: W(M) and Vj? * (M) for i = 0,...,3, qi, q2 € D. The formulas 
are almost the same, but we must consider all the couples of directions. 
'The only formulas that cannot be generalized to any couple of directions 
are the formulas (3.38). So suppose that M is a point of G \ 3. We have 
to express the Q-convexity around M w.r.t. two directions qi,qo. There 
is at least a quadrant Z7'?^(M) that contains U or U2, and hence it 
cannot be empty. Consider all the qı-lines and qo-lines that do not pass 
by this quadrant. If one of these lines has a strict majority of its unknown 
points in a quadrant, then we have found another quadrant that cannot be 
empty. So in this case we can find 7; and i9 such that (3.38) expresses the 
Q-convexity around M. In the other case, it is easy to see that the formula 
VAP (M) & Vj? (M) holds, and so we can express the Q-convexity by 

(TEPOV VARON) A (VrO v VAF): — Ga9 


Theorem 6. RECONSTRUCTIONo(p)(D) can be solved in O(n?)-time, where 
n = maxyep(max(k | f(p, k) > 0} — min{k | f(p, k) > 0J). 


Remark 3. RECONSTRUCTIONQ((5,4)nz(p)(D) can also be solved in O(n?) 
time. Indeed, we can generalize the algorithm to reconstruct sets that are only 
Q-convex in two directions p, q and line-convex w.r.t. the other directions. To 
this goal, the set Go is computed using the directions p and q, and the filling op- 
erations are executed exactly as before, since they only use line-convexity. For 
the reduction to 2-SAT, we use five variables by point W(M) and V7" (M), 
and we can express Q-convexity in the same way, whereas we express line- 
convexity on the line ř = k by adding the formulas W((i, k—0)5,.) > W((i, k)) 
for k < g(8F) + f(r, k) and W((i, k + 6)) > W((i,k)) for k > r(85) — f(r, k). 


3.6 Further Remarks 


In this section we present some remarks about complexity results that can be 
obtained by the algorithm described in the previous section. 

First we show that the algorithm can be applied to solve a more general 
problem, and then we see how it generalizes some known algorithms previ- 
ously designed for the reconstruction of special classes of lattice sets having 
convexity and/or connectivity properties. 
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3.6.1 Other Algorithmic Problems 


Consider the following problem: 

PRESCRIBED_RECONSTRUCTIONQp)(D) 

Input: A map f : D x Z— No with finite support, and two sets A and B. 

Output: A set E € O(D), if it exists, that satisfies A C E C B and (3.9). 
'To solve this problem, we can use the algorithm of Section 3.5 with a little 

modification: The initial a and 8 are settled to (U1, U3) UA and Bor B, which 

results in 


Theorem 7. PRESCRIBED. RECONSTRUCTIONo(p) (D) can be solved in O(n?)- 
time. 


Notice that the corresponding problem for the class of all the lattice sets 
is NP-complete when |D| > 3, even if A contains almost all the points of the 
solutions ([18, Theorem 5.1]). 

We can also consider the following algorithmic problem: 
UNIQUENESSQ(pj)(D) 

Input: A lattice set E € O(D). 

Question: Does there exist another set E' € Q(D) that has the same X-rays 
in D? 

'The corresponding problem for the class of all the lattice sets has been 
proved to be NP-complete for |D| 7 3 ([18, Theorem 4.3]). In Section 3.4 we 
have characterized the sets of directions of uniqueness, that is, for which sets 
D the answer to UNIQUENESSoQ(p)(D) is always negative. For the other sets 
of directions, the answer depends on the X-rays of the set E in input. We can 
give the following result. 


Theorem 8. UNIQUENESSg(p)(D) can be solved in O(n9)-time, where n = 
maxpep(max{k | X, E(k) > 0) — min{k | X, E(k) > O}). 


Proof. Consider a set E € Q(D) and choose an arbitrary direction p € D. Let 
G be defined by (3.44). If there exists a set E' that has the same X-rays in 
D as E, then there is an integer k such that E {p = k} and Fn (p = k} 
are different. Hence to solve the problem, we only have to apply the following 
algorithm: 


(a) For all k such that X, E(k) > 0, do the following: 

a) Let I, J be the endpoints of EN (p = k}. 

b) Apply an algorithm solving PRESCRIBED_RECONSTRUCTIONg:p)(D) 
with f(p,i) = X,E(i) for all (p,)) € Dx Z, A=0, B = G \ {I}. If the 
algorithm finds a solution, then return the answer “yes.” 

c) Same as above with I replaced by J. 

(b) Return the answer “no.” 











By Theorem 7 the complexity of this algorithm is O(n9). 
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3.6.2 Reconstruction of Lattice Sets with Line-Convexity and 
Connectivity Constraints 


Consider the problem of reconstructing hv-convex 8-connected lattice sets 
from the horizontal and vertical directions. 


Lemma 10. A lattice set E is hv-convex 8-connected if, and only if, the two 
following assertions are true: 


1. E is Q-convez w.r.t. (hv). 
2.{k € Z| X,E(k) > 0) and {k € Z| X, E(k) > 0] are sets of consecutive 


integers. 


Proof. 'l'he direct sense is a consequence of Proposition 1. 

Conversely, suppose that E satisfies conditions 1 and 2. By Proposition 
1 we know that E is hv-convex. Let A = (x4,yA) and B = (xp,yp) be 
two points of E. We suppose that r4 < rpg and ya < yp (the other cases are 
similar). Let Ay = A+(0,1), A? = A+(1,0), As = A+(1,1), and suppose that 
none of these points is in E. By Q-convexity we have Z3"(A1)M E = 0, and 
Z¥"(Ag)ME = 0. Moreover, Z2”"(A3)NE = 0 or Z?"(A3)NE = 0, but the first 
case implies X, E(x 44-1) = 0 and the second one implies X; E(ya +1) = 0. So 
one of the points A1, A», A3 isin E. So, by a recurrence on (zp—z4)--(yp —yA) 
we can prove that there is a 8-path from A to B. 














Since condition 2 concerns the X-rays only, by Theorem 5 it follows that: 


Theorem 9. If D is a set of directions containing the horizontal and vertical 
directions, then RECONSTRUCTIONp,nz(p)(D) can be solved in O(n?)-time. 


'To extend this result to 4-connected lattice sets, we need the following lemma: 


Lemma 11. A lattice set E is an hv-convez polyomino if, and only if, E is 
hv-convex 8-connected and for any integers j and k such that X,E(j) > 0, 
X,E(j +1) » 0, X,E(k) > 0, Xn(k+1) > 0, we have that E is not a subset 
of either Zà^ (3, k) U Z3” (j - 1, k - 1) or Z(G 4- 1, K) U Z^ (3, k +1). 


Proof. The direct sense is clear. 

Conversely, suppose that E satisfies the conditions about the quadrants. 
If E is not 4-connected, it means that there exist two points A and B that 
are not connected by a 4-path. As E is 8-connected, we can suppose that the 
difference A — B is in {(+1,+1)}. Suppose, for example, that B = A+ (1,1), 
but A + (1,0) and A+ (0,1) are not in E; we have E C Zj^(A) U Z3"(B), 
which contradicts the hypothesis on E. 


























Theorem 10. /f D is a set of directions containing the horizontal and vertical 
directions, then RECONSTRUCTIONp,nz(p)(D) can be solved in O(n?)-time. 
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Proof. Let f be the function given in RECONSTRUCTIONp,,c(p)(D). We can 
suppose that {k € Z| f(z, k) > 0) and {k € Z| f(y,k) > 0) are made up of 
consecutive integers, since otherwise the problem has no solution. 

Then we can apply the algorithm outlined in Remark 3, where we add the 
formulas 


Vi +1, k) V Va(j, k + 1), Vo, k) V VIG +1, k + 1) (3.46) 


to the final 2-SAT formula F. 

We must check that if the truth assignment to variables W satisfies F, 
then F(W) is a polyomino. As there is no horizontal or vertical lines with 
null X-ray, for any point M € G such that W(M), exactly one V;(M) is set 
to false. It corresponds to the only quadrant such that Zi(M) n F(W) z 0. It 
follows that V;(M) is set to true if, and only if, F(W) n Zi(M) # 0, and, by 
the previous lemma, F(W) is a polyomino. 

















Lemma 12. A lattice set E is line-convez w.r.t. (h,v,d) and 6-connected if, 
and only if, the following assertions are true: 


1. E is Q-convex w.r.t. {h,v, d}. 
2. {k € Z| X, E(k) > 0}, {k € Z| X E(k) > 0), {k € Z| XaE(k) > 0} are 


sets of consecutive integers. 


'The proof of this lemma is left to the reader, as it follows the same ideas as 
the one of Lemma 10 (see [10, Proposition 2.5.10]). 
We deduce the following: 


Theorem 11. /f D is a set of directions containing the horizontal, vertical, 
and diagonal directions, then RECONSTRUCTIONp,nr(py(D) can be solved in 
O(n?)-time. 


Remark 4. 'The algorithms presented [3, 8] also solve the problems of Theorems 
9, 10 and 11. The complexity given in these papers is O(n* log(n)), but these 
complexities were based on the hypothesis that the operation ©’ is applied 
only one time. This cannot be supposed in the general case. 


Remark 5. In [9] the authors give an algorithm that solves 
RECONSTRUCTION7,nz((A,v)) (L^, v}) in O(n*) time. So it has a better worst- 
case computational complexity than the algorithm presented here. T'he algo- 
rithm can also be generalized to Q-convex lattice sets (see [7]) and more than 
two directions, but its complexity grows exponentially with the cardinality 
of D. A comparison of the algorithms presented in [8] and [9] is given in [2], 
where the authors show that the algorithm of [8] is better from the viewpoint 
of average time complexity and memory requirements. 


Remark 6. In [14, Section 7.3] are collected intractability results for the re- 
construction of the polyomino and/or convexity property. 
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3.6.3 Reconstruction of Convex Lattice Sets 


By combining Proposition 1 and Theorems 3 and 6, we obtain 


Theorem 12. /f D is a set of directions containing four directions whose 
ordered cross ratio is not in {4/3,3/2,2,3,4}, then RECONSTRUCTION¢(D) 
can be solved in O(n?)-time, where n = maxpep(max{k | f(p,k) > 0) — 
min{k | f(p, k) > O}). 


We still do not know the computational complexity of RECONSTRUCTION¢(D) 
when D does not uniquely determine C and, for example, when D consists of 
the horizontal and vertical directions. 


3.6.4 Practical Considerations 


If we want to solve RECONSTRUCTIONQ(pj(D) in practice, it seems that the 
choices of the base points are often unnecessary (see [10, Annexe B], [2] for 
the two-directions case). 

Indeed with three directions or more, the filling operations and an excep- 
tional use of 2-SAT (when there are more than one solution) seem to be almost 
always sufficient, but no theoretical result justifies this fact. 

In practice, reconstruction from approximate X-rays must also certainly be 
considered. There are theoretical results about reconstruction of sets that have 
a convexity property from approximate X-rays ([20, 7]). The algorithms are 
straightforward extensions of the exact case of [9]: [20] considers the case of two 
directions, while [7] works with more directions, but the obtained theoretical 
and experimental time-complexities are worse than in the exact case. 
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A 


Algebraic Discrete Tomography 


L. Hajdu and R. Tijdeman 


Summary. In this chapter we present an algebraic theory of patterns that can be 
applied in discrete tomography for any dimension. We use that the difference of two 
such patterns yields a configuration with vanishing line sums. We show by introduc- 
ing generating polynomials and applying elementary properties of polynomials that 
such so-called switching configurations form a linear space. We give a basis of this 
linear space in terms of the so-called switching atom, the smallest nontrivial switch- 
ing configuration. We do so both in case that the material does not absorb light and 
absorbs light homogeneously. In the former case we also show that a configuration 
can be constructed with the same line sums as the original and with entries of about 
the same size, and we provide a formula for the number of linear dependencies be- 
tween the line sums. In the final section we deal with the case that the transmitted 
light does not follow straight lines. 


4.1 Introduction 


One of the basic problems of discrete tomography is to reconstruct a function 
f: A — {0,1}, where A is a finite subset of Z” (n > 2), if the sums 
of the function values (the so-called X-rays) along all the lines in a finite 
number of directions are given. A related problem on emission tomography 
is to reconstruct f if it represents (radioactive) material emitting radiation. 
If f(z) = 1 for some i € A, then there is a unit of radiating material at 
i; otherwise, f(i) = 0 and there is no such material at i. The radiation is 
partially absorbed by the medium, such that its intensity is reduced by a 
factor 8 for each unit line segment in the given direction (with some real 
number 8 > 1). 

As an illustration we include an example. In Fig. 4.1 the row sums of 
f (the number of particles in each row, from top to bottom) are given by 
[4, 4, 2,5, 1,2], while the column sums (the number of particles in each col- 
umn, from left to right) are [2, 3,2, 1,2, 3, 2, 3]. Further, taking the line sums 
of f in the direction (1, —1), i.e., the sums of elements lying on the same 
lines of slope —1, we get (from the bottom-left corner to the top-right corner) 
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Fig. 4.1. The symbols e denote particles on a grid that are represented in the 
table f on the right by 1’s. In the classical case the light is going horizontally and 
vertically, resulting in row and column sums. In the emission case the particles emit 
radiation, which is partially absorbed by the material surrounding the particles. The 
intensity of the radiation is measured by detectors, denoted by [ signs. 


[0, 0, 1, 1,2, 3, 1,3,3,2,0, 2, 0]. Finally, suppose that the particles emit radia- 
tion in the directions (—1,0) and (0, 1). If 8 is the absorption coefficient in 
these directions, i.e., the absorption on a line segment of unit length is pro- 
portional with 8, then the “absorption row sums" (measured at the detectors) 
from top to bottom are 


D + BT? + 6T + 67T, ROT EP RET «qt. (4.1) 
BT + B7? + eae B75 + ges teg. (4.2) 

and the “absorption column sums” from left to right are given by 
Bree SERE Eu ee ee ee, qu 
B? + BT? + 678, 6T! + 074, BT? + iar ale (4.4) 
In the past decade considerable attention has been given to this type of 
problems; see, e.g., [6, 7, 15, 16] and especially [19] for a historical overview. 
Many papers investigate the problem under which circumstances the line sums 
determine the original set uniquely; see, e.g., [1, 8, 9, 11, 25] for the nonab- 
sorption and [20, 21] for the absorption case. However, in many cases more 
than one configuration yields the same line sums. Observe that the “differ- 
ence” of two configurations with equal line sums has zero line sums. Such a 
difference is called a switching configuration. In the case of row and column 
sums they were already studied by Ryser [23] in 1957. We refer to [17, 18] 
for the case of two general directions and for the investigation of so-called 
switching chains. Shliferstein and Chien [25] studied switching configurations 
in situations with more than two directions. Switching configurations play a 
role in solution methods of, e.g., [1, 13, 17, 18, 20, 21, 25]. Already Ryser [23] 
showed in the case of row and column sums that every switching configuration 

can be composed of simple switching components 
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g ES (4.5) 


An algebraic theory on their structure was developed by the authors [12, 14] 
based on switching components of minimal size, so-called switching atoms. In 
order to reconstruct the original itself, one can use additional known properties 
of the original object to favor some inverse images above the others, such as 
convexity (see, e.g., [1]) or connectedness (see, e.g., [3, 4, 13]). For an extensive 
study on the computational complexity of discrete tomographical problems, 
see [10]. 

In this chapter we describe a general algebraic framework for switching 
configurations. We collect and at certain points generalize some of our previous 
results. We show that our method can be applied to more general problems 
than only the classical ones in discrete tomography. We mention that, though 
we focus on Z” only, the results presented below can be generalized to any 
integral domain R such that R|z1,..., £n] is a unique factorization domain. 
We recommend the book of Lang [22] as a general reference for algebra. 

To formulate the above problems in a precise way, we introduce some 
definitions and notation that we use throughout this chapter without any 
further reference. Let n be a positive integer. The jth coordinate of a point 
v € Z” will be denoted by v; (j = 1,...,n), that is, v = (v1,..., Un). Let m; 


(j — 1,...,n) denote positive integers, and put 

A={ieEZ" |0<i; <m, for j=1,...,n}. (4.6) 
Let d be a positive integer, and suppose that Š are equivalence relations on 
A for k = 1,...,d. (For example, points are equivalent if they are on a line in 
some direction characterized by k.) Let H D ied HË ) denote the equivalence 
classes of ©. Finally, let oy : A — Ryo be so-called weight functions for 

d 
k= 1,...,d, and set o = J, op. Now the above-mentioned problems can be 
k=1 


formulated in the following more general way. 


Problem 1. Let cx; be given real numbers for k = 1,...,d and l =1,..., tx. 
Construct a function g : A — {0,1} (if it exists) such that 


NO Sou) cw (k=1,...d; Pag (4.7) 
ice g(? 


It is important to note that Eq.(4.7) is certainly underdetermined with 
respect to functions g : A — Z. Moreover, the same may be true for solutions 
g: A— {0,1}. For example, the function g given by 
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01101010 
10001101 

| 01000100 
J: 11110001 
00000010 
00000101 


(4.8) 


has the same row and column sums as f from Fig. 4.1. Consequently, h := f—g 
has zero row and column sums. Vice versa, having a function h: A — Z with 
zero line sums, the line sums of g+ h will coincide with those of g. It turns out 
that the study of switching configurations over Z is much simpler than that 
over {0,1}. It is therefore important to note that the solutions to Problem 1 
can be characterized as the solutions of the following optimization problem 
over Z. 


Problem 2. Construct a function g: A — Z (if it exists) such that (4.7) 
holds, and 


Yo) o(i) is minimal . (4.9) 
icA 


Remark 1. If g is a solution to Problem 1, then g is a solution to Problem 2. 
To show this, let f : A — Z be any other solution to (4.7). Then we have 


V^ 9(i)?0(i) = V s()e() = V, Fe) < V fü o). — (410) 


icA icA icA icA 


The idea used here, that a binary solution has small “length,” has been used 
in several papers; see, e.g., [3, 4, 13]. 


Remark 2. We also mention that when the equivalence relations & mean that 
the corresponding points are on the same lines in given directions, and the 
weight functions op are defined as certain powers of some real numbers jy > 1, 
then in view of Remark 1, our problems just reduce to the classical problem 
of emission tomography with absorption. In particular, when y = 1 (o; = 1 
for every k), we get back the classical problem on discrete tomography. 


As we indicated, we will study the structure of the set of integral solutions 
of Eq.(4.7). It turns out that in case of line sums there exists a minimal 
configuration (the so-called switching atom) such that every integral solution 
of the homogenized equation (4.7) (i.e., with cy; = 0) can be expressed as a 
linear combination of shifts of one of the switching atoms. For the case of row 
and column sums, the switching atom is 


itur (4.11) 


In this chapter we characterize and derive properties of switching configura- 
tions. 
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The structure of this chapter is as follows. In the next section we briefly 
outline the main principles of our method. In Section 4.3 we give a complete 
description of the set of integral solutions of (4.7) in case of the classical 
problem of discrete tomography, for arbitrary dimension (see Theorem 1). 
Theorem 2 shows that if Problem 2 admits a solution, then a relatively small 
solution can be found in polynomial time. In Section 4.4 we derive similar 
results for the case of emission tomography with absorption, also for any 
dimension n. Finally, in Section 4.5 we consider a new type of tomographical 
problems. Instead of lines, the X-rays (in Z?) are assumed to be parallel broken 
lines or parallel shifts of the graph of a function G: Z — Z. It turns out that 
our machinery is applicable in this case, as well. 


4.2 The Main Principles of the Method 


In this section we summarize the main principles of our approach. Our method 
relies on the following four fundamental observations. 


1) If both functions f,g: A — Z are solutions to Eq. (4.7), then the difference 
h:— f — g isa solution to (4.7) with cj; = 0 for all k,l, that is, to 


3o sitse) e. (iem Tod L=1,...t)- (4.12) 
ien(? 


So to characterize the set of integral solutions of (4.7), it is sufficient to 
know one particular solution g together with all the solutions of (4.12). 
2) Suppose that Hi,..., His a partition of A. Let f: A— Zand fj: Hı > 
Z (Il = 1,...,t) be given functions and write xr(z) = Y; f(i)zi for the 
icA 
generating polynomial of f. Suppose that xp (£) = >> fi(i)zi vanishes 
icH, 


t 
for | — 1,...,t, and that x(x) = $5 xp (x). Then x(x) vanishes. 
i=1 


3) If x f(z) is divisible by polynomials Pi(z),.... Ps(x) € Z{z], then x(a) is 
divisible by lem(Pi(z), ..., Ps(z)) in Ziz]. 

4) Let f be a solution to Eq.(4.12). Then in the cases investigated in this 
chapter we have x(x) = P(x)Q(z), where P corresponds to a “minimal” 
solution M to (4.12), and Q indicates which combination of the translates 
of M yields f. 


'To illustrate how these principles work, we exhibit some examples. 


Example 1 (row sums). Let n = 2, A= ((45,3) | O € à < mi, 0 € j < ma), 
and H; = ((4, 0D) |0 € i < mi) for l = 0,..., m2 — 1. Let f: A5 Z bea 
given function. Define fı: Hı > Z for l = 0,...,ma — 1 by fili, l) = f(i, D 
(i =0,...,m, — 1). Then 


60 L. Hajdu and R. Tijdeman 


m»-—1 mi-—1 


xeles y) = X xn(@,y) and xg(z,y) =y SO fii Da. (4.13) 
l=0 1=0 
m;i-—1 
(i) Suppose $7 f(i,l)= 0 for l= 0,..., ma — 1. Then 
1=0 
mi—1 mi-—1 
xu (y) - v! M, fi) — v M. FGI =0 forl=0,...,m2-1. (4.14) 
1=0 1=0 
Hence 
TH3 — 1 
xk -— SS fae y 2, fI (4.15) 
(EM 


m»-—1 


is divisible by x — 1 for l = 0,...,ma — 1. Thus xy(z,y) = > xg(x.y) is 
i—0 
divisible by x — 1. 
mi-—1 : 
(ii) Let 8 € C, and suppose that $5 f(i,0)8* =0 for l = 0,...,ma — 1. 


i=0 
Then 
mi-—1 . m;i-—1 : 
xan 2 v M5 ADE =y V. fi) —0 forl=0,... mi -1. 
1=0 1=0 


(4.16) 
Hence x y, (x,y) is divisible by z — over C for | = 0,...,m2—1. Then y f(z, y) 
is divisible by x — 8 over C. Since x(x, y) € Z[z, y], this implies that x; = 0 
if B is a transcendental number and that x +(x, y) is divisible by the minimal 
defining polynomial of 8 if it is an algebraic number. The above argument can 
be given for columns, as well. 


Barcucci, Frosini, and Rinaldi [2] treated the binary case (i.e., only coef- 
ficients 0 or 1) where the row sums are measured in both directions for the 
absorption coefficient 3 = (1 + /5)/2. They proved that in that case the row 
sums determine the configuration uniquely. Since it is a good illustration of our 
approach, we show how this conclusion follows from the above considerations. 
Suppose there are two distinct binary solutions. Then the polynomial f, de- 
fined as the difference of both characteristic polynomials, has only coefficients 
1, 0, and —1 and vanishing row sums into both directions. The polynomial 
m-—1 . 

5 f(i,))z’' is therefore divisible by both the minimal polynomial z? — x — 1 
i=0 

of p and the minimal Spoori x? +x — 1 of 67! for all l. Hence both 
= f (i, DB? = 0 and n f (i, 1)(—8)* = 0. By addition and subtraction we 


find that both >> fü.ngi = — 0 and Y; /f(i,1)8* = 0. Since the nonzero 
i even i odd 
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coefficients have modulus 1 and 8? > 2, the first nonzero term of each ex- 
pression exceeds the sum of the remaining terms. We conclude that all the 
coefficients of f are 0 and therefore the solution is unique. 


Example 2 (row and column sums). On combining Example 1 with 1 to the 
m;-—1 

row sums and with 65 to the column sums, we obtain that if S fG, DB 20 
i=0 

for | =0,...,m2—land T f(l, j) 63 = O for l =0,. .,m,—1, then xy = Oif 


=0 
B1 or 62 is PETET E and that otherwise x is divisible by the product of 
the minimal defining polynomials P (x, 1) of 81 and P2(1,y) of 85 (as Pi (a, 1) 
and P2(1,y) are coprime). 


Kuba and Nivat [20] studied the special case of row and column sums for 
fi = f» = (1+ y5)/2 (cf, Example 4 in Section 4.4.1). The situation of 
having different absorption coefficients in different directions was studied by 
Zopf and Kuba [26] in another context. 


Example 3 (line sums). Let n and A be as in Example 1 and a,b € Z. Without 
loss of generality, we may assume that a > 0. Suppose first that we have b < 0. 
Put Hı = {(i, j) | aj = b¢+1} for l = 0,..., m with m = (mi —1)b+ (ma — 1)a. 
Hence A is the disjoint union of the H;. Define the functions fı: Hı — Z for 
the above values of | by fi(4,3) = f(i, j) ((i, j) € Hi), where f: A— Zisa 
given function. Then 


x, ET (x,y) where x s (z, y) oe fili, j)z’y? . (4.17) 
1=0 (i,j)EH 


Let 8 € C, and suppose that $5 fili, j)8* =0 for 1=0,...,m. Then 
(EM 


xu(my)9 Y. AG, jayt e= y Y^ AG, jy" ^Y =0 (4.18) 


(ü3)€ Hi (Gj3)€ Hi 


for « = fy-"/* and | = 0,...,m. It follows that xy (By ^, y) = 0. Equiv- 
alently, x;(8y-^,y^) = 0. We conclude that y; = 0 if 8 is transcenden- 
tal and that otherwise x; is divisible by the minimal defining polynomial of 
g/d — g2/dy—b/4, where d = gcd(a, b), if 8 is algebraic. Similarly we find in 
case b > 0 that x; is divisible by the minimal polynomial of 2?/dy*/4 — $a/4, 


Combine Pape 1 with 8 = £1 and Example 3 with a = 1, b= —1, 8 
d . Suppose »» "fis DBÈ = 0 for | —0,..., M2 — 1 and p2 Fi 3)8; y2i 








—itl 
0 for l = E ‚Mı t ma — 2. Then x; is divisible by path polynomials z— fi 
and x — By? y over C. By the theorem of Gelfond-Schneider, we know that if 
£1 Æ 0,1, then By? is transcendental if 01 is algebraic. Hence either 3; = 0 
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and x; is divisible by x, or 3; = 1 and xy; is divisible by (x — 1)(z — y), or 
A ie 0. 

Combine Example 3 with a = 1, b = —1, 8 Æ 0 arbitrary and Example 
3 with a = b = 1, and ^! in place of 8. Suppose X /(i,j)0* = 0 for 

je-i 
1=0,...,mi+m2—-—2and Y; f(ij)8 ^ =0 for 1 = —mi  1,..., ma — 1. 
j=i+l 

Then x is divisible by both polynomials z— y and zy— 8^! over C. Hence x; 
is identically zero if 8 is transcendental. If 8 is algebraic, then x is divisible 
by the product of the minimal polynomials of x — Gy and zy — 0-1. 

Finally, combine Example 3 with a = 1, b = —1, B # 0 arbitrary and 


Example 3 with a = 1, b = —1, 8^! in place of 8. (The latter condi- 
tion is equivalent with a — —1, b — 1, absorption coefficient 8.) Suppose 

YS f(uj8 = M f(üj)8^? = 0 for |l = 0,..., mi + ma — 2. Then 
je-i je-i 


xf = 0 if B is transcendental. If 8 is algebraic, then y (zx, y) is divisible by 
the minimal polynomial of xy — 0, and, if the minimal polynomial of 8 is 
nonreciprocal, even by the product of the minimal polynomials of z — Gy and 
gy — Bly. 


4.3 Discrete Tomography in nD 


In [12] we developed a theory on switching configurations in case n = 2. In 
this section we generalize it to arbitrary n. 


4.3.1 Some Notation 


Let a € Z” with gcd(a1,...,an) = 1, such that a 4 0, and for the smallest 
j with a; # 0 we have a; > 0. We call a a direction. By lines with direction 
a, we mean lines of the form b + ta (b € R”, t € R) in IR". Let A be as in 
the Introduction. By the help of a direction a, we can define an equivalence 
relation on A as follows. We call two elements of A equivalent if they are on 
the same line with direction a. If g: A — Q is a function, then the line sum 
of g along the line T = b + ta is defined as Y; g(t). Note that the line sums 
ic ANT 
are in fact the “class sums" from (4.7), corresponding to the above-defined 
equivalence. 





We will work with polynomials F € Q[xi,...,2x,]. For brevity we write 
z = (z1,...,24) and z* = J] x (i € Z”). The generating polynomial of a 
j=l 


function g: A — Q is defined as 


x(a) = So oz. (4.19) 


ic A 
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A set S = {a,}¢_, of directions is called valid for A if x lanj| < mj for 


any j =1,...,n. Suppose that S is a valid set of directions. for A. Fora € $, 
put falx) = (x& —1) [I Sm and set Fs(x) = n fa, (x). Let 
aj «O0 


d 
U ={u|0< uj «m; - 3 las G =1,...,0)}. (4.20) 


For u € U, put F(u;s)(z) = x“ F(x) and define the functions M(y,3): A> Z 
by 
M(u;sy(i) = coeff(z*) in Fiy,s)(z) forie A. (4.21) 


The M(,;sys are called the switching atoms corresponding to the direction 
set S. By the minimal corner of the switching atom M(g;s), we mean the 
element i* € A for which Mg; s)(Z) z 0, but M(gsj(i) = 0 whenever i € A 
lexicographically precedes i*. That is, i* is lexicographically the first element 
of A for which the function value of M(o,s) is nonzero. It follows from the 
definitions of fa and Fs that 


Mio;sy(2*) uk]. (4.22) 





Since it corresponds with the minimal corner of M(o;sj, for every u € U we 
define the minimal corner of M(,;s) as i“ + u. Again, the minimal corner of 
M(y;sy is lexicographically the first element of A for which the function value 
of M(y;s) is nonzero, and we also have 





Mus (i +u) = +1. (4.23) 
It is clear that a function g defined on A can be considered as a vector (a 
i mj-tuple). If we want to emphasize this, we write g instead of g. We always 


ssi that the entries of these vectors are arranged according to elements of 


A in lexicographical order. The length of g (or g) is |g| = |g| = ./ > 9(i) 
icA 


4.3.2 The Structure of the Switching Configurations 


Our main result shows that every switching configuration is a linear combi- 
nation of translates of the switching atom M(g;sj. 


Theorem 1. Let A be as before, let S = {a,}¢_, be a valid set of directions 
for A, and let R be either Z or Q. Then any function g : A — R with zero line 
sums along the lines corresponding to S can be uniquely written in the form 


g= X Mus) (4.24) 


ucU 
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with some c, € R (u € U). Moreover, every such function g has zero line 
sums along the lines corresponding to S. 


Remark 3. As one can easily see from the proofs, if S is not valid for A, then 
the only function having all its line sums zero is the identically zero function 
on A. 


'To prove the theorem, we need the following lemma. 


Lemma 1. Assume that a is a valid direction for A, and let R be either Z 
or Q. Then a function g: A — R has zero line sums along the lines with 
direction a if, and only if, falx) divides v(x) in Rz]. 


Proof. We give the proof only when a; > 0 (j = L,...,n); the proof is similar 
in all the other cases. Put B = {b | b € A, b— a ¢ A}, and for b € B set 
Ip = max{t € Z| b+ ta € A}. Observe that we can write 


Ip Ty 
Xola) = V So g(b + tau? = X` fM g(b + ta)a* (4.25) 


bc B t=0 bceB —t—0 
Ia t—1 Io 

— (z& — 1) M 2X gb+ta) M 2272 - Y g(b+ta). (4.26) 
bcB — 1-0 s—0 bcB t20 


As falx) = x% — 1 and the line sums of g in the direction a are given by 
Ie 
35 g(b+ ta), the lemma follows. 
t=0 














Proof (of Theorem 1). By definition, for every u € U, the function F(,5; is 
divisible by fa, for any k with 1 < k < d. Hence by Lemma 1, M(,;s) has 
zero line sums along all the lines corresponding to S. This proves the second 
statement of Theorem 1. 

Now let 


H = {f : A — R | f has zero line sums corresponding to S} . (4.27) 


We first prove that the switching atoms generate H. Suppose that g € H. 
Lemma 3 (from Section 4.4) implies that the polynomials fa, (x) are pairwise 
nonassociated irreducible elements of the unique factorization domain R{z]. 
Hence by Lemma 1, we obtain 


Fs(r)| Xo(a) in Riz]. (4.28) 
Hence there exists a polynomial h(x) = $5 cuz” in R[x] such that y4(z) = 


ucU 
h(z)Fs(x). We rewrite this equation as 


xg(z) = >> cuFtus)(2) - (4.29) 
ucU 
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Now by the definitions of g(x) and the switching atoms M(y;s), we immedi- 
ately obtain 
g= M cM(us) 5 (4.30) 
ucU 
which proves that the functions M(,;s; generate H. 
Suppose now that for some coefficients |, € R (u € U) we have 


S luMu,s)(i) = 0 for alli A. (4.31) 
ucU 


By the definitions of the switching atoms, at the minimal corner of M(o;s; all 
the other switching atoms vanish. This immediately implies lọ = 0. Running 
through the switching atoms M(,;s) with u € U in increasing lexicographical 
order, we conclude that all the coefficients lu are zero. This shows that the 
switching atoms are linearly independent, which completes the proof of the 
theorem. 














'The following result is a consequence of Theorem 1. 


Corollary 1. Let A, S, and R be as in Theorem 1. Let C be the set of those 
elements of A that are the minimal corners of the switching atoms. Then for 
any f : A— R and for any prescribed values from R for the elements of C, 
there exists a unique g: A — R having the prescribed values at the elements 
of C and having the same line sums as f along the lines corresponding to S. 





Proof. As every switching atom takes value +1 at its minimal corner, we 
obtain unique coefficients c, € R (u € U) such that 


9:— f >> euM(us) (4.32) 
ucU 


has the prescribed values at the element of C. By the second statement of 
Theorem 1, the line sums of f and g corresponding to S coincide. 














4.3.3 Existence of “Small” Solutions 


We provide a polynomial-time algorithm for finding an approximation to f 
having the required line sums. We first compute a function q: A — Q having 
the same line sums as f in the given directions by solving a system of linear 
equations. Subsequently, we use the structure of switching configurations to 
find a function g : A — Z that is not far from q and f. The general result 
is given in Theorem 2. It follows that in case when f has (0,1) values, the 
algorithm provides a solution g : A — Z satisfying (4.7) with |g(i)| x 247! + 
1 on average, where d is the number of directions involved. The function 
obtained by replacing all function values of q greater than 1/2 by 1 and all 
others by 0 provides a good first approximation to f in practice. In [13] an 
algorithm is given that relies on this principle. 
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Theorem 2. Let A, d, and S be as in Theorem 1. Let all the line sums in the 
directions of S of some unknown function f : A — Z be given. Then there 


Tree 


g: A—Z such that f and g have the same line sums corresponding to S. 
Moreover, 


lal € |f] - 277! (4.33) 





TL 
II»; 
j=1 


d 
Proof Put N; = X}, |ax;| for j = 1,...,n. First, compute some function 
k=1 


q: A — Q having the same line sums as f. It can be done by solving the 
system of linear equations provided by the line sums. This step is known to 


s: A — Z with the same line sums as f. We follow the procedure used in 
the second part of the proof of Theorem 1 and start with the minimal corner 
i* of M(g;sj. With an appropriate rational coefficient ro with |ro| < 1/2, the 
value (q + roM(g;s))(2*) will be an integer. We now continue in increasing 
lexicographical order in i and choose coefficients r; subject to |r;| < 1/2 such 
that the value of (q + $5 ri M(y;s))(i) is an integer. (Here < under the >> 
<i 
refers to the lexicographical ordering.) Observe that the values at i’ (i’ < i) are 
not changed in the ith step. After executing this procedure for the whole set C 
of the minimal corners of the switching atoms, we obtain a function s having 
integer values on C. By a similar process (taking the switching atoms one 
by one, in increasing lexicographical order) we get that there exist integers ty 
(u € U) such that the values of f+ >) t, M(y;s) and s coincide on C. As these 
ucU 
functions have the same line sums corresponding to S, applying Corollary 1 
with R — Q, we conclude that they are equal; hence s takes integer values 
on the whole set A. Clearly, this construction of s needs only a polynomial 


ae n 
Consider now all the functions as vectors ( [| m;-tuples), and solve over 
j=l 
Q the following system of linear equations: 


(s, Mo;s)) = 3 C (Mus); Mu;s)) (4.34) 

ucU 
in c, where (.,.) denotes the inner product of vectors and v runs through 
the elements of U. As the switching atoms are linearly independent according 
to Theorem 1, this system of equations has a unique solution. This can be 


> lc;lMqu;s), where ||a|| denotes the nearest integer to a. Observe that 
ucU ` S 
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s— >) & Mus) is just the projection of f (but also of q and s) onto the 
ucU 7 
orthogonal complement of the linear subspace generated by the switching 


atoms. This implies 
lel € I£| - | ^ (cs. Wel DMqas) | - (4.35) 
ucU 


There are at most 27 switching atoms that contribute to the value of any fixed 
point, each with a contribution at most 1/2 in absolute value in the above 


T 
equation. Thus, we may conclude |g| < |f| + 277! , / [T m;. 
j=l 
Finally, notice that all the steps of the above algorithm are polynomial in 


max (m;). Thus, the proof of Theorem 2 is complete. 
jl n 














Remark 4. We mention that if we know that Problem 1 admits a solution, 


tk 
i.e., f has {0,1} values in the above theorem, then |f| = 2 Cr (for any 
\ i=1 


k = 1,...,d), whence we get |g| < (277! +1),/ [[ mj. Moreover, as noted 
j=l 

in the proof of Theorem 2, we can replace |f| with |g| (or with |s|) in the 

upper bound (4.33). Therefore, an upper bound for |g| can be given that only 

depends on the line sums and the directions. 


4.3.4 Dependencies Among the Line Sums 


Obviously, the sum of all row sums of a function f : A — Z coincides with 
the sum of all column sums of f. In this subsection we give a simple formula 
for the number of dependencies among the line sums corresponding to S. 
Let A, S, and Fs(z) be as above, and write N; for the degree of Fs in 
x; (j = L,...,n). Then by Theorem 1, the switching atoms form a basis 


of a module of dimension [[(m; — N;) over Z. Suppose that Ls denotes 
j=l 


the number of line sums for A corresponding to the directions in S, and let 
Dg denote the number of dependencies among these line sums. Then as the 


n 
number of unknowns is [[ m;, elementary linear algebra tells us that 


g=1 
Ds = Ls + [| (mj; - Nj) - [[m;. (4.36) 
j=l j=l 
In particular, if n = 2, then there are apm + |bk|Mmı — az|bg| line sums 


belonging to a direction (ap, bk) € S. Hence in this case, as aj > 0 we have 
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d d d 
Ds — ma ar + mi M |br|— X. axlde| (4.37) 
k=1 k=1 k=1 
d d d d d 
+ (r — Y) (v — 5 ex) — mim» = So ax 5 lbp] — 3 ax |x| ; 
k=1 = = 


k=1 1 k=1 k=1 
(4.38) 


4.4 Emission Tomography with Absorption 


In this section we generalize the results from [14], which were presented for 
dimension 2, to the case of general dimension. 

To model the physical background of emission tomography with absorp- 
tion, consider a ray (such as light or X-ray) transmitting through homogeneous 
material. Let Jo and J denote the initial and the detected intensities of the 
ray. Then 

I-—lo:e **, (4.39) 


where u > 0 denotes the absorption coefficient of the material, and x is the 
length of the path of the ray in the material. We put 6 = e", and we call 8 
the exponential absorption coefficient. We mention that as u > 0, we have 
B > 1. Note that by the absorption we have to work with directed line sums 
that do depend not only on the line, but also on the direction of the radiation 
through that line. 

We further assume that g represents (radioactive) material that is emitting 
radiation. If g(z) = 1, then there is a unit of radiating material at i; otherwise, 
g(i) = 0 and there is no such material at i. 

As we have absorption, we attach some absorption coefficient to each di- 
rection. Hence we slightly adjust our previous notation. Let d be a positive 
integer, and let S = {(a,, 6k) | k = 1,...,d} be a set, where a, € Z” with 
gcd(aji,..., aka) = 1 for k = 1,...,d, and for the real numbers fy we have 
By 2 1. For k = 1,...,d, put By = {b € A|b+a, € A}, and for any i € A let 
SG,k) denote the integer for which i = b — (suk) — 1)a, with some b € By. By 
the directed absorption line sum of g along the line T = b— tap (b € By,t € Z), 


we mean 
X gto. e. (4.40) 
ieTnA 





(Here there is a hidden assumption on the shape of the absorbing material, 
but this is irrelevant for the switching configurations.) In Fig. 4.1 in the In- 
troduction, we illustrated how directed absorption line sums are interpreted. 

Let i, © i for i, is € A and k=1,...,d if, and only if, i, — i; = ta, for 
some t € Z, and write H t bbs Hi ) for the equivalence classes of m Taking 
arbitrary real numbers cy; (k = 1,...,d; 1=1,...,t~), Eq.(4.7) is just given 
by 
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wee Maer (eed oa TEL (4.41) 


ieg(? 


Thus, in this case Problem 1 is the standard problem in emission tomography 
with absorption. (See also the DA2D(8) reconstruction problem in [20] for the 
two-dimensional case.) 

If the absorption is independent of the direction, then 3, = e"l2!, since |a| 
is the distance between consecutive lattice points on the line b — ta. However, 
we prefer to leave the possibility open that the absorption coefficient depends 
on the direction in which the medium is passed. Our definition of sq; p) makes 
it possible to distinguish between two opposite directions. Thus, b — ta and 
b — t(—a) represent the same line, but opposite directions. 

Finally, we mention that in case when 6p = 1 (k = 1,...,d), the problem 
reduces to the classical problem of discrete tomography. 


4.4.1 The Structure of the Switching Configurations 


In this section we give a full description of the set of solutions g: A > Z 
to (4.41). First we consider the case when cj; = 0 for all k = 1,...,d and 
l = lL...,ty, which is when all the directed absorption line sums of g are 
zero. For this purpose we need some further notation. 

First we note that if any of the 8;-s are transcendental, then f is uniquely 
determined by its directed absorption line sums in the corresponding direction 
ap. Hence from this point on, we assume that all the exponential absorption 
coefficients are algebraic. 

Let a € Z^ be a direction (i.e., gcd(a1,...,a4) = 1). Let 8 be a nonzero 
algebraic number of degree r, and let Pa(z) be the defining polynomial of 8 
having coprime integral coefficients. Put 


fas) = Pass) [] e; . (4.42) 


aj «0 


Hence f(5,5)(x) € Z[z]. 

In the proof we shall make use of a fundamental correspondence between 
functions g : A — Z and polynomials in n variables. Namely, to such a function 
g, we attach the polynomial 


OSS: Oe (4.43) 


ic A 


Then into direction a the line sums of g are the coefficients of x,(z) “modulo” 
f(a,8). The polynomials are pairwise coprime except for some well-described 
special cases, when they are conjugate. Therefore, the polynomial Fs defined 
below represents the least common multiple of the polynomials f(4,.5,). Let 
S = {(a,, B) | k = 1,..., d) bea set, where for each k, a, is a direction and 
Gx is a real algebraic number with 8; > 1 of degree rz. Two elements (ap, Gx) 
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and (a, 8.) of S are equivalent if a, = a, and 6; and ĝe are algebraically 
conjugated elements, or a, = —a, and fy aud 1/8. are algebraically conju- 
gated elements. Let S* be a subset of S containing exactly one element of S 
from each class of this equivalence relation. Put 


Fs(z- ][] fe. (4.44) 


(a), Ak )ES* 


We say that S is valid for A if Nj := deg, (Fs(x )) «m; (J = 1; 0): 
Put U = {u € Z” | 0 < uj < mj- N; Gj = 1,. n)}. For u € U set 
F(u;s)(z) = z*Fs(zx), and define the functions M(y;s) A > Z by 


Mus) (i) = coeff(z*) in F(s)(z) forie A. (4.45) 


The functions M(,;s) are called the switching atoms corresponding to the set 
S. By the minimal corner of the switching atom M(g;s), we mean the element 
i* that is lexicographically the first element of A for which the function value 
of M(o,s) is nonzero. The minimal corner of M(,;sj is i“ + u. 

Our main result in this section shows that switching configurations can be 
obtained as combinations of shifts of the switching atom M(g;sj also in the 
case of emission tomography. 


Theorem 3. Let A, S, and M(u;s) be as above, with the assumption that S is 
valid for A. Then any function g: A — Z with zero directed absorption line 
sums corresponding to the pairs (aj, 6k) of S can be uniquely written in the 
form 


g — M cuM(us) (4.46) 


ucU 


with c, € Z (u € U). Moreover, every such function g has zero directed 
absorption line sums corresponding to the elements of S. 


Remark 5. Note that if S is not valid for A, then there is no nontrivial f 
having zero directed absorption line sums in the directions given by S. This 
fact simply follows from the proof of Theorem 3. 


As an illustration, we give two examples (partly from [14]). 


Example 4. First we consider a similar situation as Kuba and Nivat do in 
[20]; however, in Z?. Let S = (((—1,0,0), 8), ((0, 1, 0), 8), ((0,0, 1), 8)), where 
B = (1+ V5)/2. Then we have Pa(z) = 2? — z — 1, 


f(1,0,0,8) (21,22, 23) = —21 — 21 +1, f(0,3,0),8) (21,22, 23) = 2$ — £2 — 1, 
(4.47) 

and 
f((0,0,1),8) (21, £2, £3) = 12 adj — 1. (4.48) 


Thus, we obtain 
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Fs(z1,22,23) = (x22 —22x2—27-- 3112 —2123—21—22-- 134-1) (13-23 — 22), 

(4.49) 
and N; = Nə = N3 = 2. So if A is of type m; x m2 x ma with m1, M2, M3 > 3, 
then S is a valid set for A. Now M(g,s) is given by 


0 0 0 0...0 0 0 0 0...0 0 0 0 0...0 
0 0 0 0...0 0 0 0 0...0 0 0 0 0...0 
sa. 1 1 0...0 -1 1 1 0...0 1—1—1 0...0 
labi 0...0 1-1-1 0...0 -1 1 1 0...0 
[clc 0...0 pere 0...0 ep 1 1 0...0 

(4.50) 


where these tables represent the values of M(o;s) on the “slices” corresponding 
to the coefficients of 1,273,723 in Fs, respectively. (All the other values are 
zero.) The switching atoms M(,;s) (u € U) form a basis of the set of functions 
g: A — Z having zero line sums corresponding to the three elements of S. 


Example 5. Now we consider an example for n = 2 where both opposite di- 
rections and different exponential absorption coefficients occur. Let 


S= Neb 0), B), ((1,0), 8), ((0, =e), ((0, 1),6)}, (4.51) 

with 8 = (1+ J/5)/2, y = 24 v2, and ô = y/2. We obtain Pg(z) = z? — z — 1, 
P,(z) = z? — Az + 2, and P5(z) = 2z? — Az + 1. We have 

f(-1.0),9)(@1,42) = —91 - 31-1, fi, (21,22) = 31 21 — 1, (4.52) 


and 
f((0,—1),) (£1; 22) = F((0,1),5) (41, z2) = 223 — 43 +1, (4.53) 


as y and 1/ó are associated elements. We get 





Fs(z1,22) = —2x122--Av (12 —112-61222 —122225--322 —222--Az3—1, (4.54) 


and Nj, = 4, No = 2. So if A is of type mı x ma with mı > 5 and m» > 3, 
then S is a valid set for A. Now M(o;s; is given by 


0 0 0 0 0 0 TE 0 

0 0 0 0 0 0 E. 0 (4.55) 
—2 0 6 0 —2 0 E: 0 

4 0 —12 0 4 0 ae 0 
—1 0 3 0 —1 0 Vaf 0 


and the switching atoms M(u;s) (u € U) form a basis of the set of functions 
g: A—Z having zero line sums corresponding to the four elements of S. 
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To prove Theorem 3, we need several lemmas. To keep this exposition 
self-contained, we include their proofs. Lemma 2 shows the correspondence 
between zero line sums and division by polynomials. Note that line sums of 
functions A — L are defined in the obvious way. 


Lemma 2. Let A be as before, a a direction, and B a nonzero algebraic num- 
ber. Let L be some field containing the splitting field of Pa(z). Put 


fap (2) Br T g”. (4.56) 


aj «0 


Then a function g: A — L has zero line sums corresponding to the pair (a, P) 
if, and only if, fray (x) divides xg(x) in L[a]. 


Proof. We prove the lemma only with a; > 0 (j = 1,...,n), as the other cases 
can be treated similarly. 

Put B = {b € A|b-- a ¢ A} and let Ij be the number of the points of A 
on the line b — ta (b € B,t € Z). Observe that we may write 


Iy—1 Iy—1 
= E (b — sa)x os x? 2 X` g(b — sa) ^. (4.57) 
bEB s=0 bEB  s=0 


If :2— 8 divides x, (x) in L[z], then after substituting zı — 61% T] yu 
j=2 
Ip—1 


the polynomial xg(x) becomes identically zero. This shows that 57 g(b— 
s=0 


sa)9-? vanishes for every b € B; hence g has zero absorption line sums cor- 
responding to (a, 8). This proves the “if” part of the statement. 
To prove the “only if” part, suppose that all the line sums 


Ia—1 1-1 
S gk- sa) = B® 3 g(b— (I —s—1)96* (be B) (4.58) 
s=0 s=0 


of g corresponding to (a, 8) vanish. This means that Ó is a root of the poly- 
Iy-1 
nomial Q(z) :— M g(b — (Ip — s — 1)a)z* for each b € B. Thus, for every 


b € B, the bolyoridi Qp(x*) is divisible by z — 8 over L. Hence x* — 8 
divides x(x) = > z5*C—794Q,(z*) in L[z], and the lemma follows. 
beB 














Lemma 3. Using the notation of Lemma 2, write r for the degree and go 
(1 € c € r) for the conjugates of B. Then the polynomials fia go (£) (1 € 
c € r) defined in Lemma 2 are pairwise nonassociated irreducible elements in 


Liz]. 
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Proof. As gced(a1,...,@n) = 1, the irreducibility of these polynomials is a 
simple consequence of Corollary 2 of [24] p. 103. The statement that the 
polynomials are pairwise nonassociated is trivial. 














aj 


Corollary 2. The polynomials Pa(x£) |] dx 
aj «0 


are irreducible in Zz]. 


Proof. We prove the statement only for a; > 0 (j = 1,...,n); the other cases 
are similar. 

Let (9 (1 € c € r) be the conjugates of 3, and let L be the splitting field 
of Pg over Q. Then, in view of 


Pa(2*) = co | [(z& — 8), (4.59) 


where co is the leading coefficient of Ps, the statement immediately follows 
from Lemma 3. 














In the next lemma we show that the divisibility property of Xg over L in 
Lemma 2 implies a stronger property over Z. 


Lemma 4. Let a and B be as in Lemma 2. Using the previous notation, a 
function g : A — Z has zero line sums corresponding to the pair (a, B) if, and 
only if, Pe(x*) JI a divides x(x) in Z[z]. 

aj «0 


Proof. The “if” part of the statement easily follows from Lemma 2. We prove 
the “only if” part only for a; > 0 (j = 1,...,n); the other cases can be handled 
similarly. In this case observe that by Lemma 2, x%— p divides x4(x) over any 
field L that contains the splitting field of Pg(z). However, by conjugation, for 
every conjugate 3 of 8, ze — 9(? also divides x,(x) in L[r]. By Lemma 3, 
this assertion immediately implies the statement. 














Lemma 5. Let a, a* be directions, and 3,3" be nonzero algebraic numbers of 
degrees r and r*, respectively. Then the polynomials Pg(x*) [| p and 
aj «0 
* —r*at ; A ; : 3 * 
Pox (x2 LIT vx, ^ are associated in Z[x] if, and only if, either a = a* and 
j 
B and B* are conjugated, or a = —a* and B and 1/8* are conjugated. 


Proof. The “if” part of the statement is trivial. Suppose that Pa(x£) [] gp a 
a5 <0 


ok ok 


and P3. (a) IL 2; “7 are associated. Then the degrees of 6 and 3* must 
be equal, i.e., r = r*. For 1 € c € r, let 8 and Br) be the conjugates of 
DB and 8*, respectively. Let L be any field that contains the splitting fields of 
both Pa and Po». Then we have the factorizations 
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Pa(a*) Il a = Il faa (2) (4.60) 
c=1 


aj<0 
and 


Po» (z2) Il Tj TAS IL fe soa) (4.61) 
a5 <0 c=1 
in L|z], where the polynomials on the right-hand sides are defined in Lemma 
2. By our assumption and Lemma 3, we obtain that for each cı with 1 < 
ci < r, there exists a cg also with 1 < c2 X r, such that f(,5)j(x) and 
m gx(e2))(£) are associated elements in L[r]. By comparing the exponents 
of x; (j = 1,...,n) in these polynomials, we get that a = +a“ holds, and 
for the corresponding pairs (c4, c3), 3°) = Br) or Bier) g* ^9 = 1 is valid, 
respectively. This yields that (80 | 1 < e <r} = (8*9 | 1€ e € r) or 
{8 |1«ex r} 2 (1/0*? | 1 & e € r), respectively, which establishes the 
“only if” part of the statement. The proof of the lemma is now complete. 

















Proof (of Theorem 3). By definition, for every u € U, the function F(,;sj is 
divisible by f(a,,a,) for any k with 1 < k < d. Hence by Lemma 2, M(u,s) 
has zero line sums corresponding to the pairs in S. This proves the second 
statement of the theorem. 

Let 


H={f: A— Z | f has zero absorption line sums for the elements of S} . 
(4.62) 
We first prove that the switching atoms M(y;s) (u € U) generate H. Combin- 
ing Corollary 2 and Lemmas 4 and 5, for any g € H, we obtain 


Fs(z) | xg(z) in Z[z] . (4.63) 


Hence there exists a polynomial Q(z) = »; cuz" with c, € Z (u € U) such 
ucU 
that Q(z)Fs(z) = xq(z). We rewrite this equation as 


Xg(x) = CuF (u;s)(2) - (4.64) 
ucU 


Now by the definitions of xg(x) and the switching atoms M(y;s), we immedi- 
ately obtain 


g= M csM(us) 5 (4.65) 
ucU 


which proves that the functions M(,;s; generate H. 
Suppose now that for some coefficients lu € Z, we have 


M luMusy (i) -0forieA. (4.66) 
ucU 
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By the definitions of the switching atoms, at the minimal corner of M(g,s) all 
the other switching atoms vanish. This immediately implies lg = 0. Consid- 
ering now M(y.s) with u € U in increasing lexicographical order, we conclude 
that all the coefficients lu are zero in (4.66). This shows that the switching 
atoms are linearly independent, which completes the proof of the theorem. 














Remark 6. Similarly as in case of the classical problem of discrete tomography 
in Section 4.3, it would be possible to provide an algorithm that produces a 
“small” integral solution to (4.7) in case of emission tomography. We omit the 
details. 


4.5 Tomography on Curves 


In this section we illustrate that our method is rather flexible in the sense 
that variations to other sums than line sums are possible. In this more gen- 
eral case, there do not exist translation-invariant switching atoms. However, 
our polynomial method allows us to construct nontrivial configurations with 
vanishing sums and characterize such configurations in Theorems 4 and 5. 
We shall illustrate the method in two dimensions by examples where sums 
are taken over sets of the shape Hk = {(i,7) € A | akj = bkG(i) + t}, 
where G: Z — Z,t € Z and the (aj,b;) are distinct pairs of coprime 
integers for k — 1,...,d. The basic idea is that to the given function g : 
A — Z we adjoin the “generating” polynomial X g(i,j)rC ?y) (instead 
(453)€A 
of Y; g(i,j)x'y?). Since akj = b G(i) + t, the exponent pairs (G(i), j) for 
(53)eA 
(i, j) € Hy are on the lines apy = bgg + t. So the sums over Hy, turn into line 
sums, and we can apply the preceding theory. Doing so, we find switching 
atoms. The problem is to return to the original situation, where there is 
no linear structure. However, by constructing polynomials with exponents 
of prescribed form that are multiples of the switching atom polynomial, we 
are able to construct configurations with vanishing sums for all given Hj. We 
give two examples. 


Example 6 (Broken line sums). We consider the situation where light (or X- 
ray) entering from the left along the half-line ay = ba + t (a € 0) is broken 
when reaching the y-axis and continues along the half-line ay = cbx--t (x > 0), 
where c is a given integer. 

'To describe this case, we need to slightly adjust our previous settings. Let 
m1,m» be positive integers and n4 a negative integer. Put 


A-((i,j))eZ |n <i<m,0<j <m}, (4.67) 


and let aj, by (k = 1,...,d) and c be nonzero integers with gcd(ax, by) = 1 
and ap > 0 (k = 1,...,d). Set 
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Ta = { (i,j) € Z? |i € 0, apj = byi + t}U{(G,7) € Z? | à 9 0, akj = cbki + t} 


(4.68) 
for k = 1,...,d and t € Z. Let (i, j1) È (iz, j2) for (i1, j1), (i2, j2) € A and 
k = 1,...,d if, and only if, these points belong to the same set Tkt for some 
MEM t. Write Hj i ), HË ) for the equivalence classes of Š on A. These 


classes are in fact a. interecetionis of the broken lines Tj, with A. By the 
broken line sums corresponding to (ax, bk) of a given function g : A — Z, we 
mean the expressions 


Ck] = 5 g(t,j) fork=1,...,d; l=1,...,th. (4.69) 
(i3) e Hj? 


Note that (4.69) is a special case of Eq.(4.7), with unit weights oy = 1 (k = 
1,...,d). 

With the above modifications we can apply our machinery to the broken 
line case as well. First we introduce some further notation. 


d 
Let S = {(ax, bx) }¢_, with (aj, bpk) as above, and write Nı = Y; aj and 
k=1 


d 
me: |b.|. We say that S is valid for A if Ny < mı — nı and Ng < mo. 


For k= 1 o, d, put 


xry —1, if bp > 0, 
flow =| i m (4.70) 


T 


g^ — yk, ifby <0, 


and set Fs(z,y) = n fia, y). 


In view of the bike lines, we define 


0 m»o3—1 mi—1m»o3—1 
xs(m y) 227 | XO M gi jey + 3 gli, jay? | — (4.71) 
ini j—0 i=l  j—0 


as the “generating” polynomial of g : A — Z. Note that the factor z "' is 
introduced only to keep the exposition inside Z[z, y]. 
For the solutions of (4.69), we have the following: 


Theorem 4. Let A and S be as above, with the assumption that S is valid 
for A. Then a function g: A — Z has zero broken line sums corresponding 
to S if, and only if, xg(z, y) is divisible by Fs(x,y) in Z[v, y]. 


Proof. Let g: A — Z bean arbitrary function and let (a, b) € S. For simplicity 
we assume that b > 0; the case where b « 0 is similar. Observe that we can 
write 
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0 mi—1 
wana=o" 7| $7 wanes 9 dX sao 
tcZ i—ni aj=bi+t i—l aj=cbi+t 
0xj«ma O0<j<me2 
0 mı—1 
—z Y y^ Y^ > ga TrA M, GN 
bcZz i—ni aj=bi+t i—l aj=cbi+t 
0xj«m»a 0xj«m»a 


(4.72) 
Now just as previously (see, e.g., the proof of Theorem 1), we obtain that g 
has zero broken line sums corresponding to (a,b) € S if, and only if, z*y^ — 
1 divides x,(z,y) in Z[x, y]. Observing that the polynomials f(x,y) (k = 
1,...,d) are pairwise coprime (in fact, prime) elements of Z[x, y], the theorem 
follows. 














We illustrate the above theory by the example when S = ((1,1), (3, 1)) 
and c — 2. In this case the broken line sums are calculated in accordance with 
Fig. 4.2. Moreover, we have 


Fe(z,y) = (zy —1)(0?y — 1) = a^? — ay — ty 1. (4.73) 


Theorem 4 shows that g : A — Z has zero broken line sums corresponding to 
S if, and only if, Fs divides Xg over Z. Hence to present a nontrivial example, 
we should find a nonzero multiple of Fs in which all the exponents of x greater 
than some nonnegative integer are even. For switching configurations entirely 
contained in ((z, y) | x < 0} or in ((z, y) | x > 0), the theory of Section 4.3 
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Fig. 4.2. Broken lines corresponding to S = {(1, 1), (3, 1)} and c = 2. 
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applies. Suppose we want a switching configuration with “minimal corner” at 
(—3,0). Then all exponents of x in yg greater than 3 should be odd. We have 


(zy-- 1) Fs(z, y) = 2*y? -a?y-a^ y +1 = z? (ay) -y- y m 7). (4.74) 


Hence if ny € —3, mı > 3, and m5 > 4, then the function g: A> Z 
represented by 
0...0 0 0 0 0 0 0...0 


(4.75) 


Quod 6S 
ooooo 
^Doooo.- 
ooooo.. 

| 
ooroo 

SOrFRCQCcCSO.::: 
oocooo.. 
ooooo 
ocooooo 


has zero broken line sums along the corresponding broken lines. Here f indi- 
cates the y-axis. 


Example 7 (Parabola sums). We consider the situation when the X-rays (or 
light) pass along parabolas ay = bx? + t (x > 0). 

Let A be as before, and let aj, bj; be coprime nonzero integers with a; > 0 
(k =1,...,d). Let (i, j1) È (iz, j2) for (i, j1), (12,52) € A and k=1,...,d 
if, and only if, b (i? — i2) = ak(j1 — j2) (i.e., for some integer tk, we have 
bki? = ajji — ty and bki? = aij» — ti, that is, these points lie on the same 
vertical translate of the graph of the function apy = bjz?). Further, write 
HP z Hi” for the equivalence classes of Š on A. Let a function g: A>Z 
be given. By the parabola sums of g corresponding to (ax, bp), we mean the 
expressions 


Ck] = 5 gli, j) fork=1,...,d; L=1,...,tk. (4.76) 
(4,3)e HP 


Obviously, (4.76) is a special case of Eq.(4.7) with oy = 1 (k = 1,...,d). 

As it will turn out, with the modifications indicated above, we can apply 
our previous results to this case. We need, however, some notation. Let S, Nj, 
No, f(x, y), and Fs(z, y) be defined as in case of broken lines. 


We choose 
x(t) = Y. s any! (4.77) 
(4,j)EA 
as the “generating” polynomial of g: A — Z. 
For the solutions of (4.76), we have the following 


Theorem 5. Let A and S be as above, with the assumption that S is valid 
for A. Then a function g: A — Z has zero parabola sums corresponding to 
S if, and only if, xg(x, y) is divisible by Fs(x,y) in Z[z, y]. 
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Proof. Let g: A — Z be an arbitrary function and let (a, b) € S. For simplicity 
we assume that b > 0; the case where b « 0 is similar. Observe that we can 
write 


x«(my)— 35 M. gli, d ah y —- V y/^ V^ gli, j)(wy’/2)* . (4.78) 


t€Z aj=bi2+t teZ aj=bi?+4+t 
(45,3)€A (4,3)€A 


Now similarly as, e.g., in the proof of Theorem 1, we can easily verify that g 
has zero parabola sums corresponding to (a,b) € S if, and only if, x%y? — 1 
divides x,(x,y) in Z[r, y]. As the polynomials f(x,y) (k = 1,...,d) are 
pairwise coprime elements of Z[, y], the theorem follows. 














We illustrate the example by analyzing two particular cases. We start with 
S = ((1,1), (1,2)}, i.e., the parabolas are given by y = z?--t and y = 22?--t5, 
respectively. In this case we have 


Fs(z,y) = (zy — 1)(ay? — 1) = 3^y? — zy? — zy 1. (4.79) 


Theorem 5 shows that g: A — Z has zero parabola sums corresponding to 
S if, and only if, Fs divides x, over Z. The problem, however, is to find some 
nonzero multiple of Fs such that all the exponents of x are squares. Suppose 
we want a switching configuration with “minimal corner" at the origin. We 
can readily verify that 











(z^y^ E ay? try?’ +y’ Ey 1) Fs (a, y) = cy’ ryt- m -ry y y^ y 1. 
(4.80) 

Thus, if mı > 2 and mə > 8, then the function g: A — Z represented by 

0.0 0 0...0 

0.0 0 0...0 

0.0 1 0...0 

0.0 0 0...0 

0.0 0 0...0 (4.81) 

0-1 0 0...0 

0-1 0 0...0 

1-1 0 0...0 

1-1 0 0...0 

100 0...0 

1 


provides a nontrivial configuration having zero parabola sums along the 
parabolas y = z? +t, and y= 22? + t4 for any t1, te € Z. 

Finally, we consider S = ((1,1), (1,2), (1,3)}, i.e., we have three parabolas 
given by y = £? + t4, y = 2x7 + te, and y = 3x? + ts, respectively. Now we 
have 
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Fs(z,y) = (xy —1)(ay? — D)(zy? — 1) 


= gyf —a?.49 — a? y4 — oy + ry? tay? +ay—1. (4.82) 





By Theorem 5 we know that g: A — Z has zero parabola sums corresponding 
to S if, and only if, Fs divides x, over Z. The problem is again to find some 
nonzero multiple of Fs in which all the exponents of x are squares. We can 
easily check that the polynomial 

(YE +y” + 2994 + y23 + y22)a9 — (y + y + 2y! 218 + 317 + 316 

P Ay) sx 4yH^ EA 4yP 3yP? 3y! | 2410 ap 24? ze y? ES yet gj (yP 3 ay 
4y? + Gy"? + 8y't + 9y? + 10y? + 10y* + 10y* + 99 + 8y? + 6y* + A? 
2y? + y)a — (y? + 2y!! + Ay? + 5y9 + 7y8 + 7y* + 8y + Ty? + Tyt 

+ By? + Ay? + 2y + 1) (4.83) 



































is a multiple of Fs in Z[r, y]. Hence we obtain a nontrivial g: A — Z having 
zero parabola sums along the three parabolas by replacing x°? with z? and z* 
with z? and making the corresponding table. 
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Uniqueness and Additivity for n-Dimensional 
Binary Matrices with Respect to Their 
1-Marginals 


E. Vallejo 


Summary. In this chapter we deal with the question of when an n-dimensional 
binary matrix is uniquely determined by its 1-marginals and with the related notion 
of (0, 1)-additivity. We present a survey of known results; several of them have been 
considered before only in dimensions 2 and 3. Here, we show how to extend them 
to any dimension. The main results are characterizations of uniqueness and (0, 1)- 
additivity: one, of algebraic nature, involves matrices with integer entries; the other, 
of geometric nature, uses transportation polytopes and permutohedra. 


5.1 Introduction 


We consider the following problems from discrete tomography: When is an 
n-dimensional binary matrix X uniquely determined by its 1-marginals (also 
called Radon transforms, hyperplane sums, or (n — 1)-dimensional X-rays)? 
When is X (0,1)-additive? When is there a binary matrix with prescribed 
l-marginals? We present a survey of known results; several of them have 
been considered before only in the two and three-dimensional cases. Here we 
show how to extend them to any dimension. This chapter is an expanded 
version of [33]. Uniqueness and additivity were treated, for arbitrary n, in a 
slightly different but equivalent language in [9], and extended to a much more 
general setting in [10]. Their computational complexity has been considered 
in [6, 12, 23]. The problem of checking if an n-dimensional binary matrix X 
(n 2 3) is a matrix of uniqueness is NP-complete [12, Theorem 2.7], while 
checking if X is (0, 1)-additive (at least when X is a pyramid) can be decided 
in polynomial time [23, Theorem 7.1]. Since (0, 1)-additivity implies unique- 
ness, determining (0,1)-additivity of a binary matrix is a computationally 
simpler approach to uniqueness in some cases. For n — 2, these problems 
have been studied for a long time and are well understood. They coincide 
with the corresponding problems for one-dimensional X-rays or line sums; 
see [4, 18] for an overview. For n — 3, a different approach was followed 
in [5, 23, 28, 29, 31, 32]. First, one notes that uniqueness and (0, 1)-additivity 
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are invariant under permutation of slices, so we can assume without loss of 
generality that the 1-marginals of a binary matrix are weakly decreasing. Un- 
der this assumption the matrices X uniquely determined by its 1-marginals 
are pyramids [28]; these can be identified, in a natural way, with matrices A 
whose entries are nonnegative integers. Thus, uniqueness and (0, 1)-additivity 
can be translated to new notions about matrices with integer entries [28, 32]. 
Moreover, by extending the newly found properties to matrices with nonnega- 
tive real coefficients, we obtain geometric characterizations for uniqueness and 
(0, 1)-additivity of a pyramid .X by looking at the intersection of the permu- 
tohedron determined by its corresponding matrix A with the transportation 
polytope in which A lies [23]. The most important notions appearing from 
this approach are minimality for a matrix A with integer entries and real- 
minimality for a matrix A with real entries. The notion of minimality has 
applications to the representation theory of the symmetric group [1, 30]; real- 
minimality ends up being equivalent to additivity [23]. Both minimal and 
real-minimal matrices also emerge as optimal solutions of certain quadratic 
programming problems [23]. It would be interesting to see if our approach can 
be extended to other similar instances in discrete tomography (DT), such as 
line sums along coordinate axes. 

Our motivation for dealing with notions for integer or real matrices, such 
as additivity, minimality, and real-minimality, is not to study similar con- 
cepts to uniqueness and (0, 1)-additivity on other matrix classes generalizing 
binary matrices. On the contrary, 7-uniqueness, minimality, additivity, and 
real-minimality help us to understand the original notions from D'T, namely 
uniqueness and (0, 1)-additivity, from a combinatorial (integer matrices) and 
a geometric (polytopes) point of view. In both instances, majorization plays a 
substantial role. The combinatorial approach provides a more economic way 
of representing a pyramid, namely through an integer matrix, which permits 
alternative descriptions of uniqueness and (0, 1)-additivity. In particular, test- 
ing additivity is simpler than testing (0, 1)-additivity. We also believe that this 
approach will lead to new results in the future. The geometric approach, on 
the other hand, yields characterizations of uniqueness and (0, 1)-additivity for 
binary matrices that make more transparent the relation between these two 
notions and give a new proof that (0, 1)-additivity implies uniqueness. An- 
other consequence is that an algorithm from [21] can be applied to obtain a 
minimal matrix with a given set of 1-marginals in polynomial time. 

The material is organized as follows. In Section 5.2 we recall the definitions 
of majorization, dominance order, and the permutohedron and review some 
known results. They will be essential in some characterizations of existence, 
uniqueness, and additivity presented here. Section 5.3 deals with the notions of 
uniqueness and (0, 1)-additivity of n-dimensional binary matrices. Section 5.4 
contains a survey of results about uniqueness, existence, and (0, 1)-additivity 
in dimension 2. In Section 5.5 we introduce the notion of minimal matrix and 
use it to characterize uniqueness. We also explain how (n — 1)-dimensional 
matrices with nonnegative integer entries and n-dimensional binary matrices 
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are related. This relation is also used in Section 5.6, where (0, 1)-additivity 
for binary matrices is translated to a new notion, called additivity, for matri- 
ces with nonnegative integer entries. In Section 5.7 we include several results 
about three-dimensional binary matrices and two-dimensional integer matri- 
ces. In particular, we present a new, shorter proof that any plane partition 
of size 2 x q is additive. In Section 5.8 we extend the notions introduced in 
previous sections for matrices with integer entries to matrices with real entries 
and describe these new notions geometrically. We also explain how minimal 
and real-minimal matrices appear in quadratic programming. The main tool 
in this section is the beautiful geometric description of majorization due to 
Rado, which is presented in Section 5.2. We apply this approach in Section 5.9 
to show the equivalence between additivity and real-minimality. 


5.2 Majorization and the Permutohedron 


This section contains some standard results about majorization, dominance 
order, and permutohedra that will be needed in this chapter. We include 
Rado’s original proof of his beautiful geometric description of majorization 
via permutohedra [24]; see also [20, p. 113]. 

For a vector a = (a1,...,@m) € R”, we denote by z(a) = (aj,...,a*,) the 
vector formed by the coordinates of a arranged in weakly decreasing order, 
that is, aj > --- > a*,. The size of a is defined by |a| = 57)", a;. We say that 
a is majorized by b = (b1,...,5,,), and denote it by a x b, if 


k k 
la| = |b], and M ^a? < X` bj, for alll <k<m. (5.1) 


i=l i=l 


If a x b and z(a) Z m(b), then we write a < b; see [15, 20]. 
Let a € R” and let o be a permutation in the symmetric group Sm. Denote 
by a, the vector (a5(1),..., 5(&)). Then the permutohedron determined by a 
is the convex hull of the set of all vectors obtained by permuting the entries 
of a: 
P(a) := conv{ as | o € Sm}. (5.2) 


It is a convex polytope. If all the coordinates of a are equal, then, P(a) is 
a point; otherwise, it has dimension m — 1. Its face lattice is known, see, for 
example, [2] (or [13, 34] if all coordinates of a are different). Here we only 
need to know the description of its vertices and edges. For this we need some 
notation. Let a € R™ and denote by aj,...,aj the different values of the 
coordinates of a arranged in decreasing order. The permutation in S,, that 
switches s and t and fixes the remaining numbers will be denoted by (st); such 
permutations are usually called transpositions. We say that a transposition 
(st) is adjacent relative to a if there exists 1 < i < l such that as = a? and 


o 


Qt = Qi+1 
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Example 1. Let a = (1,1,1), b = (2,1,0), and c = (3,0,0). Then a x b < c. 
Here P(a) is a point, P(b) is an hexagon, and P(c) is a triangle and one has 
that P(a) C P(b) c P(e); see Fig. 5.1. 





(3,0,0) (2,1,0) (1,2,0) (0,3,0) 


Fig. 5.1. The permutohedra P(a) C P(b) C P(e). 


Theorem 1. Let a € R”. The set of vertices of P(a) is precisely 
{as |o € Sm}. (5.3) 


Ifa is weakly decreasing and has at least two different coordinates, the vertices 
adjacent to acs (o € Sm) in P(a) are of the form aro for some transposition 
T that is adjacent relative to a. 


Theorem 2. For any vector a € R™ 
P(a)={xzeR”|xzx<a}. (5.4) 


Proof. Since P(a) = P(7(a)) and since æ € P(a) if, and only if, m(x) € P(a) 
for all æ € R™, we assume without loss of generality that a and x are weakly 
decreasing. Let x € P(a); then æ is a convex combination of { a5 | o € Sm }, 
that is, there are nonnegative real numbers t,, € € Sm, such that 37,1, = 1 
and æ = $5, toas. Let k € (1,..., m); then 


k k k k 
Nos ON ds ug € ly v. die S. di. (5.5) 
i=1 o i=1 o i=1 i=1 


Therefore, x < a. 

Conversely, let x € R™ be such that x < a. Assuming that x ¢ P(a), we 
obtain a contradiction: Since P(a) is convex, there is a hyperplane separating 
x and P(a), that is, there is some vector u € R™ such that (u, a) > (u, ap) for 
all p € Sm; see, for example, [14, $2.2] ((-,-) denotes the usual inner product 
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in R”). Then 357 Uoto) > Dici Uo(iyai for all c € Sm. Choose o such 
that ug(1) È Uo(2) 2 + È Ue(m)- Then, since æ is weakly decreasing, 


m-1 
(ois = Ua (Er does edu) dono (ero + En) 
i=1 
E 
> ox [sy = NEG iy) Poly To dac) + Uon en) ch + Tolin) 


3 


ll 
Ed 


I 
Me 


Ug(i)To(i) > X usai 
i=1 
=a 
(usq) — Uo(i+1)) (a1 9: + ai) + Ue(ny(ai ++ an), (5.6) 


Il 
d 


Il 
un 





which contradicts x < a. 











Corollary 1. Let a, b c R™. Then 
axb <> P(a) C P(b). (5.7) 


The following technical lemma can be found in [15, p. 63], [20, p. 121], 
or [31, Prop. 2.1]. It follows also from the face structure of P(a). 


Lemma 1. Let a € R”, b € R* be such that b < (ai,,...,ai,), for some 
sequence 1 X iq <+++ < iy € m; let à be obtained from a by substituting i; 
with bj, 1< j € k. Then à <a. 


When restricted to partitions, majorization is also known as the dominance 
order. A vector A € R” is called a partition if its coordinates are nonnegative 
integers and À4 > A9 > --- > Am. If |A| = N, we say that A is a partition of N. 
The Young diagram of A is the set D(A) := ((4,j3) | 1 < j € A; }. The length 
of A, denoted by (A), is the number of its positive coordinates, that is, the 
number of rows of D(A). We will frequently think of A as an element of R”, for 
some m > L(A), by adding some zeros to the right. The conjugate partition to 
Ais A! = (M,..., X); where k > à and Aj := {i | A; 2j )]. In other words, 
A; is the length of the jth column of D(A). From this observation, it follows 
that A" = A. For example, if A = (5,4,2,2, 1,1, 1), then A' = (7,4,2,2,1). 

Given two partitions A, u of N (we assume both have the same number of 
coordinates by adding some zeros), we say that A is dominated by wif A < p. 
The set of all partitions of N is a lattice under the dominance order and has 
the property that conjugation is an antiautomorphism [3], [20, Chap. 5D], 
that is, 

AxpSwsNn. (5.8) 


Let aE R",1z«i« j € m. A T-transform of a is the vector 


Tija = (a1,...,@; — 1,...,a05 - 1,..., G4) ER”, (5.9) 
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which only differs from a in two coordinates. We finish this section with the 
following result due to Muirhead [22]. The proof presented here is constructive 
and closely follows the proof in [20, Chap. 2B]. 


Lemma 2. Let A, p € R” be partitions of N. If X > p, then there exist 
partitions v(0), v(1),...,v(k) € R™ such that v(0) = A, v(i — 1) > v(i), 
v(i) is a T-transform of v(i — 1) for all 1 € i € k, and v(k) = p. 


Proof. Let i = max{t | M > pe}. Since A> p, 35.4 As > Tossa Hs: Note 
that i < m and that j = min{t | i < t and A, < u} is well defined. The 
inequalities À; > li = Hj > Àj, Ài > Aii, and Aj-1 > Àj imply that the T- 
transform T;;A is also a partition of N. Let v(1) = T;;A. Clearly, A > v(1). 
Finally, since Ay = ji; for alli < k < j, one has v(1) > p. Either v(1) = p 
or v(1) > p. In the latter case, we apply the previous method to obtain a T- 
transform v(2) of v(1), such that v(1) > v(2) = u. We repeat this method 
until we reach ys. This is accomplished in k = X` (A+ — pt) steps, where the 
index in the sum runs over all t such that A; > ut. 














Example 2. Let A = (6,4,2,2,2,0,0) and p = (4,3,3,3,1, 1, 1). Then, accord- 
ing to the proof of the previous lemma, we require 4 = 2-- 14-1 steps to go from 
A to u. The intermediate partitions obtained from the proof are v(0) = A, 
v(1) = (6,4,2,2,1,1,0), v(2) = (6,3,3,2,1,1,0), v(3) = (5,3,3,3,1,1,0), 
and v (4) = p. 


5.3 Uniqueness and (0, 1)-Additivity 


In this section we deal with the notions of uniqueness and (0, 1)-additivity 
for n-dimensional (0, 1)-matrices; these were introduced by P.C. Fishburn, 
J.C. Lagarias, J.A. Reeds, and L.A. Shepp [9] in a slightly different but equiv- 
alent language as the one used here. They gave characterizations of unique- 
ness and (0, 1)-additivity (Theorems 3 and 4) and used them to show that 
(0, 1)-additivity is sufficient for uniqueness (Theorem 5) but not necessary 
(Example 5). Later the author observed that, under the assumption that 
the 1-marginals are partitions, matrices of uniqueness have a simpler shape, 
namely, they are pyramids [28, 29, 32] (Theorem 6). This observation leads 
to the results presented in Sections 5.5 and 5.6. We also include a new ele- 
mentary direct proof that (0,1)-additive matrices are, up to permutation of 
slices, pyramids (Theorem 7). 

We fix some notation that will be used throughout this paper. Let N 
denote the set of natural numbers and let No :— NU {0}. For a vector 
q = (qd,...,q,) € N”, we denote q = (q1,.--,4n-1), q = qq, and 
T = %:''Q-1, and for q € N, we denote by [q] the set {1,2,...,q}. 
Also let B(q) := [qm] x -+ x [qn] denote an n-dimensional box and let 
P;: B(q) — [qi] denote the ith projection. An n-dimensional matriz of size 
qı X+: X qn is a map X: B(q) — R. A (0, 1)-matriz or binary matrix is a 
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matrix whose values are zeros and ones. For any matrix X of size q1 X ++- X dn, 
we consider its 1-marginals, also called Radon transforms, hyperplane sums or 
(n — 1)-dimensional X-rays, a^ = (a},...,@4,), i € [n], defined by 


oj: OY, X(k. ije[a]. (5.10) 


kePj !(j) 


Fix N € N. For i € [n], let a^ € NẸ be of size N. Then we de- 
note by F(al,...,a") the set of all n-dimensional matrices with real entries 
having 1-marginals a!,...,a"; by T(a!,...,a"), the set of all matrices in 
F(a',...,a”) with nonnegative entries; by M(al,...,a"), the set of all matri- 
ces in T(al,..., a") with integer entries; and by M*(a!,...,a”), the set of all 
binary matrices in M(al,...,a"). Note that F(a!,..., a") is an affine space, 
T(al,...,a") is a polytope (when n — 2, it is usually called transportation 
polytope; see, for example, [16, 34]) and M(a',...,a”) is the set of lattice 
points in T(al,...,a"). 

Let X € M*(al,...,a?); then X is called a matriz of uniqueness [9] if the 
set M*(al,..., a") has cardinality one; that is, if X is the only binary matrix 
with 1-marginals a!,...,a^; X is called (0, 1)-additive [9] if there are maps 
fi: [a] — R, i € [n], such that for all k € B(q), 


X(k) =1 if, and only if, $7 fi(ki) 20; (5.11) 
i=1 

and X is called a pyramid [29, 32] if for all k, l € B(q) such that ki < l;, 
i € [n], the condition X (l) = 1 implies X (k) = 1. Note that (0, 1)-additive 
matrices are called additive in [9]. We use the term (0, 1)-additive to distin- 
guish it from another notion of additivity introduced in Sections 5.6, resp. 5.8, 
for matrices with integer, resp. real entries. A weakly bad configuration for X 
is a matrix W € F(0,...,0) different from 0 with integer entries, such that if 
W(k) > 0, then X(k) = 1 and if W(k) « 0, then X(k) = 0. A weakly bad 
configuration for X is called a bad configuration if its entries are in (—1,0, 1]. 
'The weight of a weakly bad configuration W is the sum of its positive entries, 
that is, 5, w(k)>o W(K). 


Example 3. Let X be the three-dimensional binary matrix 


1 11 110 1 10 110 
1 1 1 1 0 0 00 0 0 0 0|. (5.12) 
100 100 0 0 0 0 0 0 


Its 1-marginals are a! = (9,4,2), a? = (8,5,2), and a? = (7,4,2,2). This 
matrix is a pyramid. A weakly bad configuration of weight 3 for X is the 
matrix W: 


1 0 0 0 0 1 0 
0 0 0 0 -1 0 0l. (5.13) 
0 0.00 0 0 0 
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Let Y = X — W. Then Y has the same 1-marginals as X. This example 
contains the main idea behind the proof of Theorem 3. Also, note that Y is 
not a pyramid. 


Example 4. Let X: B(q,q,q) — {0,1} be the binary matrix defined by 
X(k) = 1 if, and only if, kı + ko + k3 € q + 2. Then X is a pyramid and 
is (0, 1)-additive [29, Theorem 3.4]. Additivity can be shown as follows. Let 
a = max(i € [q] | 3i € q - 2). Then 0 € q +2 — 3a < 2. Therefore, there is a 
unique integer b such that $(q--2—3a) < b < $(q--3—3a). Let f: [q] — R 
be defined by f(x) := 4a 4-6 — Ax and let f; = f for i = 1,2,3. Then it follows 
from the inequalities satisfied by b that condition (5.11) holds for the maps 
fi, fo, and fa. Therefore, X is (0, 1)-additive. 


Theorem 3. Let X be an n-dimensional binary matriz. Then X is a matrix 
of uniqueness if, and only if, X does not have bad configurations. 


Proof. Suppose first that X has a bad configuration W. Then Y = X - W has 
the same 1-marginals as X, but is different from X; thus X is not a matrix of 
uniqueness. For the converse, suppose that X is not a matrix of uniqueness. 
Then there exists a binary matrix Y # X with the same 1-marginals as X. A 
bad configuration W for X is obtained by letting W — X — Y. 














Theorem 4. Let X be an n-dimensional binary matrix. Then X is (0,1)- 
additive if, and only if, X does not have weakly bad configurations. 


For a proof, see [9]. 
Since bad configurations are weakly bad configurations, Theorems 3 and 4 
imply the following: 


Theorem 5. Every (0, 1)-additive matrix is a matriz of uniqueness. 
For another proof of this theorem, see the end of Section 5.9. 


Example 5. It is shown in Theorem 5 of [9] that the matrix 


11111 111 11111 11111 
11111 111 11111 11100 
11110 110 11000 00000 
11110 100 00000 00000 
11000 100 00000 00000 


is a matrix of uniqueness that is not (0,1)-additive. This is the smallest 
known example of a matrix of uniqueness that is not (0, 1)-additive. For 
some comments about the relation between uniqueness and (0, 1)-additivity, 
see [10, 82.5). A weakly bad configuration of weight 6 is given by the ma- 
trix W: B(5,5,5) — Z that takes value 2 on (4,4, 1), value 1 on (5,1,2), 
(3,2,3), (1,5,4), (2,3,5), value —1 on (5,3,1), (3,5,1), (4,2,2), (4,1,3), 
(2,4, 4), (1,4, 5) and value 0 on the remaining points. The proof of unique- 
ness of this matrix given in [9] shows to what extent proving uniqueness for 
a matrix can be a daunting task and suggests to look for alternative ways of 
approaching uniqueness. This will be done in Sections 5.5 and 5.8. 


(- m n 
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Remark 1. A slice in the ith direction of an n-dimensional (0,1)-matrix X 
of size qı X +++ X qn is an (n — 1)-dimensional (0, 1)-matrix Y obtained by 
restricting X to [qi ]---x{jg}x---x[dn] for some j € [gi]. We can think of X 
as made of q; slices in the ith direction; these slices can be permuted according 
to some permutation c in the symmetric group Sg,. The matrix obtained, 
denoted by c X, has the same size as X. It follows from the definitions that X 
is a matrix of uniqueness, resp. (0, 1)-additive, if, and only if, o.X is a matrix 
of uniqueness, resp. (0,1)-additive. So, when considering the properties of 
uniqueness and (0, 1)-additivity for a matrix, we can assume without loss 
of generality that its l-marginals are weakly decreasing, that is, they are 
partitions. Then we have 


Theorem 6. Let X be an n-dimensional binary matriz whose 1-marginals are 
partitions. If X is a matrix of uniqueness, then X is a pyramid. 


This follows from the next lemma and Theorem 3. 


Lemma 3. Let X be an n-dimensional binary matrix whose 1-marginals are 
partitions. If X does not have bad configurations of weight 2, then X is a 
pyramid. 


Proof. Suppose X is not a pyramid. Then, there exist k, l € B(q), and 1 € 
i < n, such that k; <li, kj = lj for j #1, X(k) = 0 and X(l) = 1. Since the 
ith marginal a? of X is weakly decreasing, there must exist s, t € B(q), such 
that s; = ki, tj = lj, sj = tj for j # i, X(s) = 1, and X(t) = 0. Let W be the 
matrix defined by W(l) = W (s) = 1, W (k) = W(t) = —1, and W(r) = 0 for 
the remaining points. Then W is a bad configuration for X of weight 2. 




















Example 6. The converse of Lemma 3 is true for n = 2,3 (see Sections 5.4 
and 5.7). However, for n — 4, the converse is no longer true, as the follow- 
ing example from [32] shows: Let X: B(2,2,2,2) —9 {0,1} be the matrix 


that takes value 1 on the points (1,1,1,1), (1,1,1,2), (1, 1,2, 1), (1,2, 1, 1), 
(2, 1,1, 1), (1,1,2,2), (2,2, 1, 1) and value 0 on the remaining points. It is a 
pyramid and the matrix W : B(2,2,2,2) —> (—1, 0, 1} that takes value 1 on 


(1,1,2,2), (2,2, 1, 1), value —1 on (1,2,2,1), (2, 1, 1,2), and 0 on the remaining 
points is a bad configuration of weight 2 for X. 


Combining Theorems 5 and 6, we obtain the following result, for which we 
include an elementary direct proof. 


Theorem 7. Let X be an n-dimensional binary matriz whose 1-marginals are 
partitions. If X is (0, 1)-additive, then X is a pyramid. 


Proof. Suppose X is additive, but not a pyramid. Let f;: [qj] — R, j € [n], 
be the maps satisfying (5.11) in the definition of (0, 1)-additivity. Let k, l, s, 
t, and i be the elements constructed in the proof of Lemma 3. Since X(k) — 0 
and X(l) = 1, we have $75 , fi(kj) < 0 € 255, f;(l); therefore, fi(ki) < 
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fi(li). Similarly, since X(s) = 1 and X(t) = 0, using the identities among the 
coordinates of k, l, s, t, we obtain f;(k;) > fi(l;), which is a contradiction. 
'Therefore, X has to be a pyramid. 














Following constructions can be applied to the plane partitions associated 
to pyramids; see Remark 6. 
Let i: B(q) — B(q) be the involution (1? = 1) defined by 


(Kk) := (m 1— ky... d -1— kn), (5.15) 


and let p: B(q) —> B(q) be the rotation defined by p(k) := (Ko,..., kn, k1). 
Then, given a (0, 1)-matrix X: B(q) — (0, 1}, we define the complement of 
X as the matrix X° given by X*(k) = 1 if, and only if, X(((k)) = 0 and the 
rotation of X as the matrix X" given by X'(k) = X(p(k)). 


Lemma 4. Let X be an n-dimensional binary matriz. Then X is a pyramid, 
resp. matrix of uniqueness, resp. (0, 1)-additive, if, and only if, X* is a pyra- 
mid, resp. matrix of uniqueness, resp. (0, 1)-additive. 


Proof. The equivalence for pyramids is straightforward. The equivalence for 
uniqueness follows from Theorem 3: If W is a bad configuration for X, then 
W defined by W(k) = —W(i(k)) is a bad configuration for X°. Similarly, the 
equivalence for (0, 1)-additivity follows from Theorem 4. 














A similar lemma can be stated and proved for the rotation X* of X. 


5.4 Uniqueness and (0, 1)-Additivity in Dimension 2 


In this section we address the problem of uniqueness and (0,1)-additivity 
for binary matrices of dimension 2. Historically, the existence problem was 
considered first and independently by D. Gale [11] and H.J. Ryser [25, 26], 
see also [4, 18] and the references therein for other proofs. After that, H.J. 
Ryser gave a sufficient condition for uniqueness [26, p. 62]. E. Snapper seems 
to be the first to state explicitly, several years later, that the condition given 
by Ryser was also necessary [27, Theorem 8.1]. More recently, P.C. Fishburn, 
J.C. Lagarias, J.A. Reeds, and L.A. Shepp [9, $3] showed that uniqueness and 
(0, 1)-additivity coincide and that in dimension 2 it is enough to consider bad 
configurations of weight 2. We overview here these results, not only by of their 
own interest, but also because they will be used to prove their generalizations 
to higher dimensions. 
'The two classical versions of uniqueness and existence follow: 


Theorem 8. Let A, u be partitions of N. Then |M*(A, u)| = 1 if, and only 
if, N = p. 


Theorem 9 (Gale-Ryser Theorem). Let A, p be partitions of N. Then 
|M*(A, )| > 0 if, and only if, N = p. 
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Remark 2. The proof presented here for one direction of the Gale-Ryser theo- 
rem, namely that the majorization condition implies the existence of a matrix, 
yields an algorithm for the construction of the matrix that might be new. This 
algorithm depends essentially on the constructive proof of Lemma 2. Given 
partitions A, u such that A’ > u, Lemma 2 yields an algorithm for construct- 
ing a sequence of partitions 


A =v(0) > v(1)>---> v(k) =p (5.16) 


such that v(i) is a T-transform of v(i — 1) for all 1 € i € k. Recall that the 
number of steps in the algorithm is 


k= M Mom. (5.17) 


te (toii) 


For any i, let j, | be such that Tj;jv(i — 1) = v(i). It is shown in the proof 
of Theorem 9 how to construct a matrix in M*(A,r(2)) from any matrix 
y6G-0- Ge) in M*(A, v(i — 1)). The choice of row r where a 1 is shifted 
to the right is irrelevant as long as Ue = ] and yi) =0. 

The difference with other algorithms is that the choice of the columns 
where we shift ones to the right depends not on some matrix but only on 
sequence (5.16). For other algorithms, see [25, 26] as well as [18, pp. 10-14] 
and the references therein. 


In dimension 2, all properties for a binary matrix coincide, namely 


Theorem 10. Let X be a 2-dimensional binary matriz whose 1-marginals are 
partitions. Then the following are equivalent: 

(1) X is a pyramid; 

(2) X has no bad configurations of weight 2; 

(3) X is a matriz of uniqueness; 


(4) X is (0, 1)-additive. 


We conclude this section with proofs of these theorems. 
Let A be a partition and let p = (A), q > A1. We define X(A) = (zi) as 
the matrix of size p x q given by 


zj at fl sj sr, (5.18) 
0, otherwise. 

Then X(A) € M*(A, A"). Observe that we use the same notation for different 
matrices. If q > A1, X(A) will have q — A1 columns of zeros. The same abuse 
of notation is taking place with A'. This flexibility will be useful in the proof 
of Theorem 9. 


Proof (of Theorem 8). Suppose first that X € M*(A, ui) is a matrix of unique- 
ness. Then by Theorem 6, X is a pyramid. This forces A' = u. For the con- 
verse, it is enough to observe that X(A) is the only matrix in M*(A, A’). 
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Proof (of Theorem 9). Suppose first that there is a matrix Y in M*(A, p). 

Let X(A) € M*(A, A’) be of the same size as Y. Then X(A) is obtained from 

Y by shifting, along each row, ones to the left and zeros to the right. Let 

1 < j € (n). Since $5 à}, resp. 55 4 Ht, is the number of ones in the 

first j columns of X (A), resp. Y, then, by the relation between X(A) and Y, 
1 VALE $ us that is, X = p. 

For the converse, let us assume that A’ > p, and suppose that both 
have the same number of coordinates. Then, by Lemma 2, there are par- 
titions v(0),...,v(k) such that v(0) = X, v(k) = p, v(i — 1) > v(i), 
and v(i) is a T-transform of v(i — 1) for all i. We construct the desired 
matrix inductively. First, observe that X(A) € M*(A,v(0)). Let i > 0 and 
assume by induction hypothesis that there is a matrix YC") = (S?) in 
M*(A, v(i — 1)). Then since v (i) is a T-transform of v(i — 1), there are num- 
bers j < [such that v(i) = Ti v(i — 1). Then v(i — 1); > v(i — 1),. There- 
fore, there is some r such that p = ] and yi) = 0. Let YO = (y) 
be obtained from Y U-9 by switching the one and the zero in positions (r, j) 
and (r,l), that is, let 


0, if (s,t) = (r, j), 
W=41, if (3, =(r,), (5.19) 
yi), otherwise. 


Thus, Y is in M*(A,r(i)) We conclude, by induction, that Y“) is in 
M*(A, p). 


Proof (of Theorem 10). Since a pyramid is of the form X(A), (1) > (3) follows 
from Theorem 8. Theorem 3 yields (3) > (2) and Lemma 3 yields (2) = (1). 
The implication (4) => (1) follows from Theorem 7. It remains to prove (1) > 
(4). We apply a similar method to the one used in the proof of Theorem 14. 
Suppose X is a pyramid of size p x q. Let À1,..., Àp denote its row sums (Ay 
could be zero), and let v1,..., v, denote the different values of the row sums 
arranged in decreasing order. Let vp = q+1 > vı and v,,1 = —1 < vr. Finally, 
let do, d,,...,d, be real numbers such that vg > da > Va+1 for all 0 € a € r. 
We now show that X is (0, 1)-additive. Define fi: [p] — R by fili) = X. 
Let j € [q]. Then there is a unique t(j) such that vej) > j > vi(7) 41. Define 
fo: [a] — R by f26) = —diy). Let (i,j) be in [p] x [a]. If XG, j) = 1, 
then j € A; and VEG) € Ai. Thus, fili) + f2(3) = A; — dic) > Xr; — 45) 2 0. If 
X(i,7) =0, then A; < j and à; € Ut(3)4-1- Thus, fali) + fo(j) = Ai — dij) « 
Ai — Ui(j)41 € 0. Therefore, X is (0, 1)-additive. 


























We end this section with an example that shows how the algorithm con- 
tained in the proof of Theorem 9 works. 


Example 7. Let A = (5,5,2,2, 1, 1) and u = (4,3,3,3,1,1, 1). The conjugate 
partition to A is A' = (6,4,2,2,2) and A' > u. We constructed in Example 2 
by means of Lemma 2 a sequence of partitions 
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X =v(0) > v(1) - -> v(4)— p. (5.20) 


The sequence of matrices Y in M*(A,v(2)), 0 € i € 4, obtained in the proof 
of Theorem 9 is 


1111100 1111100 1111100 
1111100 1111010 1111010 
y(902—|1100000 y(02|1100000 y2—.—]|1100000 
— [1100000 — |1100000]|^ = [|1010000]|^ 
1000000 1000000 1000000 
1000000 1000000 1000000 
1111100 1111100 
o H3. yo. Hioid83 
iw 1010000 ire 1010000 (5.21) 
1000000 0000001 
0001000 0001000 


Here Y (9 was constructed as follows: Since v(0) and v(1) differ on the fifth 
and sixth coordinates, there must exist some i such that y = l and yo) = 
0. In this example there are two such it’s, namely, i = 1,2. We chose the 
largest one (in practice, this choice is irrelevant). Proceeding in the same way, 
we obtain Y 2, YS), and Y (9, 


5.5 Minimal Matrices and Uniqueness 


In this section we give characterizations of existence and uniqueness of n- 
dimensional binary matrices in the same spirit as in Theorems 8 and 9. In 
dimension 3, a characterization for existence was given by W.B. Jurkat and 
H.J. Ryser (Theorem 5.5 in [19]) a long time ago. Theorem 12 generalizes their 
result to any dimension. A similar treatment for uniqueness in dimension 3 was 
given by A. Torres-Cházaro and the author [28]. Their proof uses a connection 
due to E. Snapper [27] between matrices and the theory of characters of 
the symmetric group. À combinatorial proof that closely follows the original 
proof in [28] was given by R. Brualdi [5] using ideas from [19]. Theorem 11 
generalizes the result from [28] to any dimension. The proof presented here 
follows the original proof, but the crucial identity (Lemma 7), coming from 
character theory, is proved here combinatorially in the same way it as was done 
in [5] for dimension 3. It should be pointed out that the general philosophy 
of looking at integer matrices for characterizing uniqueness comes from the 
fact that matrices of uniqueness (with weakly decreasing 1-marginals) are 
pyramids (Theorem 6) and the observation that pyramids can be identified 
with integer matrices. The main notion appearing from this approach is that 
of a minimal matrix (to be defined below), which also has applications to 
Kronecker products of characters of the symmetric group [1, 30]. A third 
result of importance is a characterization for minimal matrices (Theorem 13) 
that uses the graph map. This result was first proved in dimension 3 using 
characters of the symmetric group [30, Proposition 3.1]. Here we extend the 
result to any dimension and give a new proof of combinatorial nature. 
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We use the notation from Section 5.3. Let a’ € NJ, i € [n—1], bea vector 
and v € Nj” be weakly decreasing, that is, a partition. Let A be a matrix 
in M(al,...,a"-!) and let ki,..., kg be the elements of B(q) arranged in 
lexicographic order. Then, after arranging the entries of A according to this 
ordering, we define a vector ®(A) :— (A(Ki),..., A(kg)) in Nd. A second 
vector in NI associated to A, called the z-sequence of A and denoted 7(A), is 
the vector formed by the entries of A arranged in weakly decreasing order. For 
any partition p in NZ, we denote by M,(a!,...,@"—!) the set of all matrices 
in M(al,...,a^-1) with z-sequence p. The matrix A is called m-unique if 
there is no other matrix B € M(a!,...,a?-1) with (B) = n(A); A is called 
minimal if there is no other B € M(al,...,a"-!) with «(B) < n(A); and A 
is called hyperpartition of dimension n — 1 or (n — 1)- dimensional partition if 
for all k, | € B(q) such that k; < l;, for i € [n — 1], one has A(k) > AÑ). A 
two-dimensional partition is usually called a plane partition. 

In order to state the characterization for uniqueness we need the following 
construction: Choose any qn > max{ A(k) | k € B(q) ). Then the graph G(A) 
of A is defined by 


’ if 1 < kn < Aor Ego) 


: (5.22) 
0, otherwise. 


Note that the value of qn is unimportant; what really matters is the set of 
all k € B(q) that satisfy G(A)(k) = 1. The graph of A has 1-marginals 


a',...,a"—!, n(A}. So, for any partition p € Nd" of N, there is an injective 
map 
G: Mo(al,...,a"71) — M'(al,...,a"7!, p^). (5.23) 
Example 8. Let 
331 441 432 
de ob eoe act uw dd. Cem 13 vol. (5.24) 
200 200 110 


The three matrices A, B, C are plane partitions. The 7-sequences of A, 
B, and C are, respectively, (3,3,2,2,1,1,1,0,0), (4,4,2,2, 1, 1,1,0,0), and 
(4, 3,3, 2, 1, 1, 1,0, 0). Matrix A is minimal, but not 7-unique [28, p. 447]. The 
first assertion can be checked directly by hand; for the second, just take the 
transpose of A. Matrix B is z-unique, but not minimal. The first assertion can 
be checked by hand; for the second, observe that C has the same 1-marginals 
as B, and *(C) < 7(B). The graph G(B) of B is the matrix from Example 3 
with 1-marginals a! — (9,4,2), a? — (8,5,2), and a? — (7,4,2,2). Note that 
B has l-marginals a! and a? and that (B) is conjugate to a? (zeros here 
are irrelevant). See Example 10 for other properties of C. 


Theorem 11. Let X € M*(al,...,a"-!,v). Then X is a matriz of unique- 
ness if, and only if, X is the graph G(A) of a matriz A € M(a!,...,a^-1) 
that is minimal and m-unique. 
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Corollary 2. Let A be a matrix that is minimal and m-unique. If its 1- 
marginals are partitions, then A is a hyperpartition. 


Example 9. The graph G(D) of the matrix 


55544 
55533 

D= |33110 (5.25) 
21110 
21000 


is the matrix of uniqueness in Example 5, which has 1-marginals 
a! = (23,21,8,5,3), a” = (17,15,12,9,7), and a? = (20, 14,12,8,6). (5.26) 


The 1-marginals of D are a! and a?; its 7-sequence is precisely the partition 
conjugate to a?. Theorem 11 implies that D is minimal and 7-unique. The 
graph of a matrix is customarily depicted by stacking unit cubes as shown in 
Fig. 5.2. 





Fig. 5.2. The graph of pyramid D. 


Remark 3. It should be noted that, while the property of being a matrix of 
uniqueness is symmetric in a!,..., a", the condition given in Theorem 11 is 
only symmetric in al,...,a?-!: If X is a matrix of uniqueness and its 1- 
marginals are partitions, then X is a pyramid and the matrix A such that 
X — G(A) is obtained by projecting X along the nth direction (the direction 
of the nth vector of the canonical basis of IR"). In fact, a similar result holds 
if we choose to project X in the ith direction for any à € [n]. 


Theorem 12. The set M*(al,...,a"-!,v) is nonempty if, and only if, there 
is a matriz A € M(a',...,a"~') such that «(A) = v. 


Theorem 13. Let A € M,(a',...,a"~1). Then A is minimal if, and only if, 
the map 
G: My(al,...,a"-!) — M*(al,...,a"7!, p!) (5.27) 


is bijective. 
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The rest of the section is devoted to the proofs. We start with a construc- 
tion that is related to the graph of a matrix: The projection Pr(.X) of a binary 


matrix X with 1-marginals a',...,a"~1, v is defined by 
» dn 
Pr(X)(K) := XO X (kr. ka-15) ’ (5.28) 
j=l 


for all k € B(q). Note that Pr(X) is in M(a!,...,a"~!). So, we have a map 
Pr: M*(a!,...,a"71, y) — M(al,...,a^71). (5.29) 


The z-sequence of the projection satisfies v (Pr(X))' = v (the reasoning is 
similar to the one in the proof of Theorem 9). 

A variation of the notion of pyramid will be needed below. Let i € [n]. 
An n-dimensional binary matrix X is called pyramid with respect to the ith 
direction if for all k, l € B(q) such that k; < l; and kj = lj for j # i, the 
condition X(l) = 1 implies X(k) = 1. Then a pyramid is a pyramid with 
respect to the ith direction, for all i. 

The following properties of G and Pr are easy to check. 


Lemma 5. Let A € M(a!,...,a?-1) and X € M*(al,...,a"-!,v). Then 

(1) A is a hyperpartition if, and only if, G(A) is a pyramid. 

(2) If X is a pyramid, then Pr(.X) is a hyperpartition. The converse does 
not hold in general. 

(3) Pr(G(A)) = A. 

(4) X is in the image of G if, and only if, X is a pyramid with respect to 
the nth direction. In this situation, G(Pr(X)) = X. 


Lemma 6. There is a bijection (the union on the right is disjoint) 


8: M'(a,...,a^) — U M*(9(A), a"). (5.30) 


Proof. For X € M*(al,...,a"), let A = Pr(X) and Y = (yij) be defined by 


i 5.31 
á n otherwise. 


Define (X) := Y. Then Y € M*(9(A), a"). It is easy to show that 6 is a 
bijection. 














Lemma 7. The following identity holds: 


IM*(al,...,a"-!,p)| = 5 |M* (7 (B), v)| . (5.32) 
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Proof. By Lemma 6, the left-hand side of the identity equals 


5 |M*(®(B),v)|. (5.33) 


But for any B € M(a',...,a”~"), one has |M*(®(B),v)| = |M*(z(B),v)|; 
and, by Theorem 9, |M*(a(B),v)| Z 0 if, and only if, 7(B)’ > v. Then the 


claim follows. 














Proof (of Theorem 11). Let X € M*(al,...,a"-!,v) be a matrix of unique- 
ness. Lemma 7 implies that 


5 |M*(m(B),v)|=1. (5.34) 


Then there is a matrix A € M(a!l,...,a@"~+) such that |M*(r(A),v)| = 1; 
therefore, by Theorem 8, (A)! = v. Moreover, for any B € M(a!,...,a"~?), 
different from A, one has |M*(z(B), v)| = 0. This implies that A is minimal 
and z-unique. Since G(A) € M*(a',...,a"~+, v), our uniqueness assumption 
forces X = G(A). 

Conversely, suppose X is the graph of a matrix A € M(al,...,a 
that is minimal and z-unique. Let v = «(A)'. Since A is minimal and 7- 
unique, Theorems 8 and 9 imply that Eq. (5.34) holds. Then, since X € 
M*(a!,...,a*-1, v), Lemma 7 implies that X is a matrix of uniqueness. 


Se 














Proof (of Corollary 2). By Theorem 11, G(A) is a matrix of uniqueness whose 
1-marginals are partitions. The claim follows from Theorem 6 and Lemma 5. 




















Proof (of Theorem 12). It is immediate from Lemma 7. 


Proof (of Theorem 13). Suppose first that G is not bijective. Then there is a 
matrix X € M*(a!,...,a"-1, p^) that is not in the image of G. Therefore, by 
Lemma 5, X is not a pyramid with respect to the nth direction, so there are k, 
l such that kj = lj, for 1 < j < n, kn < ln, X(k) = 0, and X(I) = 1. Let Y be 
obtained from X by switching the zero in position k and the one in position 
l, that is, Y (k) = 1, Y (I) = 0 and Y (m) = X(m) for m £ k, l. Then Y has 
1-marginals a’,...,a"~*, b, where by, = pz, +1, bi, = pj, — 1, and bj = p; if 
j Æ kn, ln. The vector b might not be a partition, but v(b) > p’ (this can be 
seen directly or by means of Lemma 1). Moreover, Pr(Y) € M(a',...,a”~! 
and m(Pr(Y)) = m(b) > p’. Therefore, by (5.8), m(Pr(Y)) < p = «(A). 
Thus, A is not minimal. 

Conversely, suppose A is not minimal. Then there is a matrix B in 
M(a,...,a"-!) such that m(B) < n(A) = p. Let A = «(B). Then G(B) 
is in M*(al,...,a^-1, A). By (5.8), A’ > p’. We can, then, as in the proof 
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of Theorem 9, apply Lemma 2 and shift ones up in G(B), along the nth di- 
rection, to obtain a matrix Y € M*(a',...,a"~1, p^). (We could also have 
used Lemma 6 and apply the idea in the proof of Theorem 9 to obtain a 
matrix Z in M*(®(B), p’) from 3(G(B)) in M*(®(B),X’). Then Y would be 
B-1(Z).) The matrix Y is not a pyramid with respect to the nth direction. 
Then Lemma 5 implies that G is not a bijection. 














5.6 Additivity for Integer Matrices 


In this section we follow the same line of thought as in the last section and give 
a condition for an integer matrix A that is equivalent to (0, 1)-additivity of its 
graph G(A). This new condition provides a more economic way of showing that 
a pyramid is (0, 1)-additive. The results presented here are taken from [32]. 

We use the notation from Section 5.3. Let a!,...,a"~!, v be as in Sec- 
tion 5.5. A matrix A € M(a!,...,a"~+) is called additive if there are vectors 
x’ c RË, i € [n — 1], such that for all k, T € B(q) the following condition 
holds: 


n—1 n—1 
A(k)>AQ = Mis >Y ri. (5.35) 
{=I i=l 


Theorem 14. Let X € M*(al,...,a"-!,v). Then X is (0,1)-additive if, and 
only if, X is the graph G(A) of an additive matriz A in M(al,..., a"). 


Example 10. The matrix C from Example 8 is additive: Let æ! = (7,2,0) 
and x? = (6,3,0). Then the matrix of sums (s;;) :— (xl + æ?) satisfies 
condition (5.35). Theorem 14 implies that the graph of C is (0, 1)-additive, 
and Corollary 3 below implies that C is minimal and z-unique. 


Example 11. The matrix D from Example 9 is not additive. This follows from 
Theorem 14, since its graph is not (0, 1)-additive (see Example 5). 


Proof (of Theorem 14). Suppose first that X € M*(al,...,a"-1,v) is (0,1)- 
additive. Then there are maps fi: [qi] — R, 1 € i < n, satisfying condi- 
tion (5.11) in Section 5.3. Moreover, since v is a partition, we can use the 
same reasoning from the proof of Theorem 7 to conclude that X is a pyramid 
with respect to the nth direction, and therefore, by Lemma 5, X is the graph 
G(A) of some matrix A € M(al,...,a"-!). We will show that A is additive. 
Let x = fi(j) for i € [n — 1] and j € [qi]. Let k, l € B(q) and suppose that 
A (k) > A(T). Let kn 2l, = A(D) +1, k = (ki... kn), and E = (l1,... pln). 
Then, since X = G(A), we have X(k) = 1 and X(l) = 0. Thus, by (5.11), 
Moa Silki) > 0 > 3554 fi(li). Since kn = ln, condition (5.35) holds. 
Conversely, if X = G(A), for some additive matrix A € M(a!,...,a"- 1). 
Then there are vectors z^ € R% satisfying condition (5.35). For i € [n — 1], 
let f;(j) = a5. The definition of f, is more involved. Let v1,...,Um be the 


5 Uniqueness and Additivity for Binary Matrices 101 


different values taken by the entries of A, arranged in decreasing order. Let 
Vo = qn + 1 > v and Um4i = —1 < Um. For each t € [m], let 


n—1l 
pı = min J Th, 
i=1 





s@=n} (5.36) 
and 
n—i 
ij; = max | 5 ri, 
i=1 


Let yo and Ym+ı be such that yo > Yı and Ym+1 < Ym. Then it follows 
from (5.35) that 





aga) (5.37) 


Qo > Yı 2 p1 > Y2 2 +: > Ym-1 > Vm = Ym > Ymi. (5.38) 


Choose numbers d;, 0 € t € m such that y+ > d; > wVi44 (they could be 
rationals). Now, for any j € [qn], there is a unique 0 € t(j) € m such that 
Ut(4) >j > Ut(3)4-1 . (5.39) 


Let f4(j) = —dij)j. Now we proceed to check condition (5.11). Let k € B(q). 
If X(k) = 1, then kn < A (k) = v, for some s. Then there is some r > s such 
that v, > kn > v,41. Therefore, 


n n—1 
XO filka) = au, — da. 
i=l i=1 
> ~s—dr > yr- dr >0. (5.40) 
Similarly, if X(k) = 0, then k, > A (k) = v, for some s. Then there is 
some r < s such that v, > kn > vp+1. Therefore, 


n 


n-1 
NO filki) = Mu — dicen) 
t=1 


i=1 


< Ws — dr X Ws — ds—1 <0. (5.41) 





Therefore, X is (0, 1)-additive. 











Corollary 3. If A is an additive matriz, then A is minimal and r-unique. If, 
moreover, the 1-marginals of A are partitions, then A is a hyperpartition. 


Proof. Let A be an additive matrix. Then, by Theorem 14, G(A) is (0,1)- 
additive. Theorem 5 implies G(A) is a matrix of uniqueness. Finally, by The- 
orem 11, A is minimal and z-unique. The last claim follows from Corollary 2. 
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Example 12. Additivity can be more easily shown as (0, 1)-additivity. For ex- 
ample, the graph of the q x q matrix 


| q q—1 q—-2 `. 2 i] 


gd ge Ves owed 
ipee. se bk rw du 

A= . . Lo wt tci (5.42) 
2 1 0 >- 0 0 
1 0 0 00 


is the matrix X in Example 4, which is (0, 1)-additive. Additivity of A follows 
directly by taking xt = x? = (q—1,q—2,...,1,0). Then Theorem 14 provides 
a simpler proof of the (0, 1)-additivity of X. 


5.7 Uniqueness and (0, 1)-Additivity in Dimension 3 


In this section we present some results about three-dimensional (0, 1)-matrices 
and two-dimensional integer matrices. We start by showing that pyramids are 
characterized by the absence of bad configurations of weight 2 [29, 83], [32, 83]. 
Then we prove (Theorem 15) a result analogous to Theorem 10 that shows 
the equivalence among being a pyramid, uniqueness, and (0, 1)-additivity for 
matrices of size 2x q x r [32]. For this we need to show that any plane partition 
of size 2 x q is additive (Lemma 8). We present a new, shorter proof of this 
result than the one given in [32, 84]. We also mention the classification of bad 
configurations of weight 3 for pyramids from [29]. We conclude with several 
examples and some comments about the weight of bad configurations needed 
to determine if a pyramid is a matrix of uniqueness. 
Let A, u, v be partitions of some natural number N. 


Proposition 1. Let X € M*(A,u,v). Then X is a pyramid if, and only if, 
X has no bad configurations of weight 2. 


Proof. One implication is Lemma 3. For the converse, suppose X has a bad 
configuration W of weight 2. Then W takes value 1 on two points, say k, l, 
and value —1 on two points, say s, t. If these four points lie on a plane parallel 
to a coordinate plane, then they must be the vertices of a rectangle; besides, 
k and l must be diagonally opposite. A simple case-by-case analysis shows 
that X is not a pyramid. If k, l, s, t are not contained in a plane parallel 
to a coordinate plane, then k and l must be diagonally opposite vertices of a 
parallelepiped and s, t must be a second pair of diagonally opposite vertices 
of the same parallelepiped. Again, a simple case-by-case analysis shows that 
X is not a pyramid. 














Lemma 8. Every plane partition of size 2 x q is additive. 
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Proof. Let A = (aij) be a plane partition of size 2 x q (q € N). Since there 
is a plane partition B = (bij) of size 2 x q whose entries are all different and 
satisfies aj; > aj; = bij > ber, we can assume without loss of generality that 
A does not have two equal entries. Then we proceed by induction on q. The 
case q = 1 is trivial. Suppose then q > 1. The matrix A obtained from A 
by deleting the first column is a plane partition of size 2 x (q — 1). By the 
induction hypothesis, A is additive. Then there exist real numbers z1, £2, 
y2...., yg Such that for all i, k € [2], j, | € {2,...,q}, the condition 


Qij > Akl > Ti + Yj > Tk +y (5.43) 


holds. We assume for simplicity that z2 = 0 (this can always be done). Let 
s = max{j € [q] | aij > azı} (since ai1 > a21, s is well defined). If s = 1, 
then a21 > a1», so we choose yı to be any number bigger than xı + ye. If 
1 < s < q, then ais > a21 > Qis+1, SO we choose yı such that 71 + ys > 
yı > max{zı + Ys+1;, ya]. If s = q, then aig > a21, so we choose yı such 
that ri + Yq > yi > ye. In the three cases, additivity follows easily from the 
induction hypothesis. 














Then by Corollary 3 we have 
Corollary 4. Every plane partition of size 2 x q is minimal and n-unique. 


Theorem 15. Let X € M*(A, p, v). If A has length 2, the following are equiv- 
alent: 

(1) X is a pyramid; 

(2) X has no bad configurations of weight 2; 

(3) X is a matriz of uniqueness; 

(4) X is (0, 1)-additive. 


Proof. (1) = (4). Let X be a pyramid. Then, by Lemma 5, there is a plane 
partition A of size 2 x q (q is the length of jj) such that X is the graph of A. 
By Lemma 8, A is additive, and by Theorem 14, X is (0, 1)-additive. Finally, 
(4) = (3) is Theorem 5, (3) = (2) is Theorem 3, and (2) = (1) is Lemma 3. 














Example 13. 'Theorem 15 is no longer true if A has length greater than 2. Let 
X be the graph of matrix A in Example 8; then X is a pyramid that is not a 
matrix of uniqueness [29, 83]: Let W: B(3,3,3) — (—1,0,1) be the matrix 
that takes value 1 on (1,2,3), (2,3,1), and (3,1,2); value —1 on (1,3,2), 
(2, 1,3), and (3,2, 1); and value 0 on the remaining points. Then W is a bad 
configuration for X of weight 3. 


Bad configurations of weight 3 for pyramids have been classified. Any 
such bad configuration of weight 3 is essentially the bad configuration of Ex- 
ample 13 or its negative. The following theorem and its corollary are taken 
from [29, 85], where proofs can be found. 
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Theorem 16. Let W: B(p,q,r) — {-1,0,1} be a bad configuration of 
weight 3 for some pyramid. Then there exist integers 


l<a<b<c<p,l<d<e<f<qandl<g<h<i<r (5.44) 


such that W or —W takes value 1 on (a,e,i), (b, f, g), (c, d, h), value —1 on 
(a, f, h), (b, d, i), (c, e, g), and value 0 on the remaining points. 


Corollary 5. Let X be the graph of a plane partition A. Then X has a bad 
configuration of weight 3 if, and only if, either A or its transpose contains a 
3 x 3 submatrix B = (bij) satisfying 


b23 < b32 < b31 < bi3 < b12 < 031. (5.45) 
Example 14. The graph X = G(E) of the matrix 


55544 
55533 
33110 
E = 21110 (5.46) 
21110 


21000 


has no bad configurations of weight 3 [29, §3]: Suppose X contains such a 
bad configuration. Then, by Corollary 5, E or its transpose contains a 3 x 3 
submatrix B = (b;;) satisfying (5.45). These inequalities imply that the three 
rows and the three columns of B are different. Then B or its transpose is a 
submatrix of matrix D in Example 9 (since D is obtained from E by deleting 
row 5). This is a contradiction because the graph of D is a set of unique- 
ness. However, E has a bad configuration of weight 6, namely W, where 
W: B(6,5,5) — 1(-1,0,1] is the matrix that takes value 1 on (1,5,4), 
(2,3,5), (3,2,3), (4,4, 1), (5,4,1), (6,1,2), value —1 on (1,4,5), (2,4,4), 
(3,5,1), (4,2,2), (5,1,3), (6,3,1), and value 0 on the remaining points. 


Remark 4. Example 14 shows how in dimension 3 bad configurations of weight 
3 are not enough to determine whether a pyramid is a set of uniqueness. It 
is an open problem whether there is a finite upper bound (at least 4 by this 
example) on the weight of bad configurations that one need not go beyond 
to determine whether a pyramid of arbitrary size (number of ones) is a set of 
uniqueness; compare with [9, §5]. It seems likely that no finite upper bound 
exists. A similar result was proved in [17], where it was shown that when there 
is no upper bound on the weight of bad configurations, one needs to determine 
whether a 3-dimensional (0, 1)-matrix is a matrix of uniqueness with respect 
to their 2-marginals or line sums. 


Remark 5. Not all minimal matrices are plane partitions. The simplest exam- 
ple of a minimal matrix that is not a plane partition is a permutation matrix. 
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A classification of all minimal matrices of size 2 x q was achieved in [31]. A 
similar classification for minimal matrices or even for additive matrices of size 
3 x q seems much more difficult; compare, for example, the results in [7]. 


Remark 6. 'There are constructions for plane partitions analogous to the com- 
plement and rotation of a pyramid. A result similar to Lemma 4 can be stated 
for plane partitions. This is of interest in the applications to Kronecker prod- 
ucts. 


5.8 The Geometry of Real-Minimal Matrices 


In this section we extend the notions for matrices with nonnegative inte- 
ger entries given in Sections 5.5 and 5.6 to matrices with real entries. For 
a matrix A with real entries, we give characterizations involving the permu- 
tohedron determined by A, where A can be minimal, real-minimal, and/or 
7-unique. In particular, we obtain a new characterization for a binary matrix 
X to be a matrix of uniqueness (Corollary 10). We also show that minimal and 
real-minimal matrices appear as optimal solutions of quadratic transportation 
problems. In fact, the optimization problem from Proposition 3 has been used 
by H. Díaz-Leal, J. Martínez-Bernal, and D. Romero to compute the dimen- 
sion of the variety of flags fixed by a nilpotent endomorphism [8]. The results 
presented here were obtained by S. Onn and the author for 2-dimensional 
matrices in [23]; their generalization to any dimension is easy, but needs some 
care. 

We use the notation from Section 5.3. Let M(q) denote the set of all 
matrices with real entries of size q1 X --: X qn. As in Section 5.5, we associate 
to A € M(q) two vectors in R?. The first uses the elements of B(q) arranged 
in lexicographic order &,..., kq and is defined as 


(A) := (A(k1),..., A(Kq)) . (5.47) 


Then ®: M(q) — R! is a linear isomorphism. The second vector associated 
to A, called the z-sequence and denoted by (A), is the vector formed by 
the entries of A arranged in weakly decreasing order. We proceed to extend, 
to matrices with real entries, the definitions given in previous sections for 
matrices with integer entries. Let A € F(a!,...,a”). Then A is called 7-unique 
if there is no other B € F(a!,...,a") such that m(B) = 7(A); A is called 
real-minimal if there is no other B € F(al,..., a?) such that m(B) < n(A); A 
is called additive if there are vectors æt € R%, 1 < i € n, such that for all k, 
l € B(q), condition (5.35) in Section 5.6 holds; and A is called hyperpartition 
if for all k, l € B(q) such that k; < l;, for all 1 € i € n, one has A(k) > A(l). 
Clearly, any real-minimal matrix in M(a!,...,a”) is minimal. The converse 
is false, as the following example shows. 


Example 15. The matrix A from Example 8 is minimal, but not real-minimal. 
A matrix B with the same 1-marginals as A and such that (B) < 7(A) can 
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be constructed by taking a convex combination of A with its transpose, that 
is, B=cA+(1—e)A!' for some 0 < e < 1. 


It is important to note that if a matrix A is additive, then any other matrix 
B such that B(k) > B(l) if, and only if, A(k) > A(J) for all k, l, is additive. So, 
additivity for a matrix A (and by Theorem 19 also real-minimality) depends 
only on the connected region of the arrangement of hyperplanes 


in RR? in which A lies. 
Finally, the permutohedron determined by A is defined by 


P(A) := P(®(A)) . (5.49) 

Note that 7(A) is a vertex of P(A). 
Theorem 17. Let A € F(a',...,a"). Then A is real-minimal if, and only if, 
P(A) n ®(F(a',...,a”)) = {®(A)}. (5.50) 


Proof. Suppose first that there is some B € F(al,...,a"), B Æ A, such that 
®(B) € P(A). Then, by Theorem 2, 7(B) < -(A). Since A # B, there is 
some 0 < € < 1 such that (£A + (1 — £) B) Z 1(A). Let C = £A + (1 — €) B; 
then, by convexity, (C) is in PLA) n ®(F(a!,...,a”)). Therefore, Theorem 2 
implies that v (C) < (A); so, A is not real-minimal. Conversely, if A is not 
real-minimal, then there is some B € F(a!,...,a") such that m(B) < m(A). 
Thus, B Z A, and by Theorem 2, (B) is in P(A) n &(F(a!,...,a")). 














Note that if A € T(a!,..., a"), then P(A) is in the first orthant. Therefore, 
P(A) n &(F(al,...,a")) = P(A) n &(T(al,...,a")). (5.51) 

'Thus, in this case, we have 
Corollary 6. Let A € T(a',...,a"). Then A is real-minimal if, and only if, 
P(A) n $(T(al,...,a"?)) = {B(A)} . (5.52) 


Corollary 7. Let A € F(al,...,a"). Then A is real-minimal if, and only if, 
there is a hyperplane H C R? that contains ®(F(a',...,a”)) and satisfies 
P(A)N H = {®(A)}. 


Corollary 8. If A € F(al,...,a") is real-minimal, then A is v-unique. 
The following lemma is analogous to part (ii) of Theorem 1 in [28]. 


Lemma 9. Suppose a!,...,a" are partitions. If A € F(al,...,a") is real- 
minimal, then A is a hyperpartition. 
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Proof. Suppose A is not a hyperpartition. Then there are k, l € B(q) and 
1 € i € n, such that k; < li, kj = lj, for j # i and A(k) < A(l). Since 
ai, > ai, there must exist s, t € B(q) such that s; = ki, ti = li, sj = tj, 
for j Z i and A(s) > A(t). Choose £ > 0 such that A(k) +€ < A(l) — £ and 
A(s) — € > A(t) +€. Let B be defined by B(k) = A(k) +e, B(l) = A(l) — e, 
B(s) = A(s)—e, B(t) = A(t)+e and B(m) = A(m) for m Z k, l, s, t. Then, 
since (A(D), A(k)) > (A(D) —e, A(k)+e) and (A(s), A(t)) > (A(s)—e, A(t)+¢), 
by Lemma 1, «(A) > «(B). But B € F(a!,...,a"), then A is not real- 
minimal. 




















Theorem 18. Let A € M(al,...,a") with m-sequence p. Then A is minimal 
if, and only if, 


P(A) ®(T(a!,...,a")) NZI = (&(B)| BeM,(al,...,a")). (5.53) 


Proof. This is similar to the proof of Theorem 17. Note that the right-hand 
side is always contained in P(A) n (T(al,...,a")) n Z4. Suppose, first, that 
there is some B € M(al,...,a") with @(B) € P(A) and (B) Z n(A). Then, 
by Theorem 2; *(B) < 7(A), thus, A is not minimal. Conversely, if A is not 
minimal, there is some B € M(a!,...,a") such that m(B) < 7(A); thus, 
(B) € P(A) N ®(F(a',...,a”)) n Z4, and «(B) Z n(A). 














Corollary 9. Let A € M(al,...,a"). Then A is minimal and r-unique if, 
and only if, 
P(A) n $(T(al,...,a"?)) OZI = ($(A)) . (5.54) 


'This corollary yields a new characterization, via Theorem 11, for a binary 
matrix X to be a matrix of uniqueness. Recall the definition of the projection 
of X after Theorem 13. 


Corollary 10. Let X be a pyramid with 1-marginals a!, a?,..., a"^*!, and 
let A be the projection of X. Then X is a matriz of uniqueness if, and only 


if, 
P(A) n &(T(a!,...,a")) nZ? = (&(A)) . (5.55) 


Proposition 2. Let A* be the optimal solution to the problem 
. 2 
mun Jk eBla) A(k) , (5.56) 
subject to A € T(a!,...,a?). 
Then A* is real-minimal. 


Proof. If A* were not real-minimal, there would exist B € T(a!,...,a”) such 
that m(B) < v(A*). Then, by Rado's theorem, B € P(A*). Since the vertices 
of P(A*) are all contained in the sphere S with center in the origin and radius 
V 24 A* (k)^, and since B is not a vertex of P(A*), B is in the interior of S, 
contradicting the optimality of A*. 
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The same proof yields the following: 


Proposition 3. Let A* be an optimal solution to the problem 


: 2 
nun pa €B(q) A(k) ’ (5.57) 
subject to Ae M(al,...,a?). 


Then A* is minimal. 


Remark 7. Propositions 2 and 3 still hold if 57, A(k)? is substituted by any 
strictly Schur-convex function defined on the first orthant. The proof is essen- 
tially the same as the one we gave above and follows from the very definition 
of Schur-convexity. See [20, Chapter 3] for many examples of Schur-convex 
functions and their applications. 


Remark 8. A particular case of an algorithm due to M. Minoux [21] can be 
used to obtain, in polynomial time, an optimal solution to the optimization 
problem in Proposition 3. Therefore, there is a polynomial-time algorithm that 


produces a minimal matrix in M(a!,...,a") for any 1-marginals al,...,a". 


5.9 Real-Minimal and Additive Matrices 


In this section we show the equivalence of real-minimality and additivity. This 
gives a description of additivity in terms of majorization, providing, in this 
way, a new approach to this notion. This also makes more transparent the re- 
lation between uniqueness and (0, 1)-additivity for a binary matrix and yields 
a new proof that (0,1)-additivity is sufficient for uniqueness (Theorem 5). 
The results presented here were obtained by S. Onn and the author for 2- 
dimensional matrices in [23]; their generalization to any dimension is easy, 
but requires some care. 


Proposition 4. Let A € F(al,...,a"), a = m(A),and a, = ®(A) for some 
permutation c. Then A is real-minimal if, and only if, there is some vector 
n € R1? such that 

(1) n is orthogonal to ®(F(a',...,a”)). 

(2) For any transposition T adjacent relative to a, one has the inequality 
(Nn, aro — as) > 0. 


Proof. Suppose first that A is real-minimal. Then, by Corollary 7, there is 
a hyperplane H containing $(F(al,...,a")) such that P(ag;) n H = {ag}. 
Therefore, there is a nonzero vector n, orthogonal to H, such that for all 
x € P(a,), © # Gz, one has (n, 2 — as) > 0. Then (1) and (2) hold. For 
the converse, suppose that there is some vector n € IR? satisfying condi- 
tions (1) and (2). Let H be the hyperplane orthogonal to n containing as. 
Since, by Theorem 1, the vertices of P(a,) adjacent to a, have the form 
arcs for some transposition 7 adjacent relative to a, condition (2) implies 


5 Uniqueness and Additivity for Binary Matrices 109 


that (n,z — as) > 0 for all x € P(a,) such that x # as. Therefore, 
P(a,) H = {as}. Condition (1) implies that ®(F(a',...,a”)) C H; there- 
fore, P(A) n ®(F(a',...,a”)) = {as}. Then Theorem 17 implies that A is 
real-minimal. 














The set F(a!,...,a”) is just the set of solutions of a system of linear 
equations. We describe below the matrix of coefficients of this system. In 
what follows we adopt the common convention that when a vector a with 
k coordinates is used in a matrix equation, x will denote the corresponding 
matrix of size k x 1 and a! will denote the corresponding matrix of size 1 x k. 
Let 

1 1 ,2 2 


M := (Gy yap 05,5015 5 Agaro AT e Aga s (5.58) 


and let p = qı +: -+qn. We define a matrix T of size p x q as follows: Its rows 
will be indexed by the set of pairs (i, j) such that 1 < i < n and 1 < j € qi; 
they are arranged in lexicographic order. In practice, the pair (i, 7) corresponds 
to the coordinate a^ of m. The columns of T' will be indexed by the elements 


j 
of B(q) arranged in lexicographic order. Then T' is defined by 


T6249 = dt sie 650) 

Then F(a',...,a”) equals the set of matrices A satisfying the equation 
T&(A) 2 m. (5.60) 
For any vector d (apio dioe dose ere, We form a 


matrix Aæ: B(q) — R defined by Az(k) :— $7, z4,- The entries of A; are 
just the numbers appearing in condition (5.35) in the definition of additivity. 
Then the following matrix equation holds for any a: 


a'T = (Az). (5.61) 
In particular, x'T is orthogonal to ®(F(a',...,a")). 


Theorem 19. Let A € F(a!,...,a"). Then A is real-minimal if, and only if, 
A is additive. 


Proof. Let a = n(A). Then ®(A) = a, for some permutation c. Suppose 
A is real-minimal; then by Proposition 4 there is a vector n € R4 satisfy- 
ing conditions (4.1) and (4.2). It follows from the proof of this proposition 
that (n,z — ac) > 0 for all x € P(a,), x £ ag. Since n is orthogonal to 
$(F(al,...,a")), n is in the row space of T. Therefore, there is a vector x 
such that —n = z'T or n = —9(A,). In order to prove that A is additive, 
we take k, | € B(q) such that A(k) > A(l). We have to show, by condi- 
tion (5.35), that Ag(k) = $7; zi, > Puri, = Ae(l). Suppose that, under 
the identity a, = ®(A), A(k) corresponds to as(s) and A(I) to agi). Then 
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Ns = —A,(k) and n, = —A;(l). Let 7 = (o(s) o(t)). Then (n, aro — ac) > 0. 
But since 


(n, arto — às) = Ns(Ao(t) ze üs(s)) + Nt(Ao(s) S s(t)) 


As(k)[A(k) — A(D] — Az (1)[A(k) — AQ)] , (5.62) 


we conclude that Az(K)[A(Kk)— A(D)] > Az(D)[A(&)— A(D)]. But if A(k) > A(D), 
then A,(k) > A,z(l). Therefore, A is additive. 
The converse is similar. Assume that A is additive; then there are real vec- 


tors z3,..., æ”, satisfying condition (5.35). To prove that A is real-minimal, 
we use the equivalence given in Proposition 4. Let z = (xj) and n = —g-T. 
then n is in the row space of T, so n is orthogonal to &(F(al,...,a")). Let 


T be an transposition adjacent relative to a. Let k, l € B(q) be such that un- 
der the identity ®(A) = as, entry A(k) corresponds to a;(,; and entry A(I) 
corresponds to a5(;. Without loss of generality, we assume that A(k) > A(I). 
Then, by condition (5.35), we have As(k) > A,(l). Then identity (5.62) im- 
plies (n, aro — a5) > 0. The claim follows. 














Proof (of Theorem 5). Let X be a (0,1)-additive matrix. Assume, without 
loss of generality, that its 1-marginals are partitions. Then, by Theorem 14, 
X is the graph G(A) of an additive matrix A. Theorem 19 implies that A 
is real-minimal, therefore A is minimal, and, by Corollary 8, A is z-unique. 
Finally, Theorem 11 implies that X is a matrix of uniqueness. 














Acknowledgments 


Supported by DGAPA UNAM, Grant No. IN-111203, and by Consejo Na- 
cional de Ciencia y Tecnología, Mexico, Grant No. 47086-F. I would also like 
to thank Pedro David Sánchez Salazar for providing the figures for this chap- 
ter. 


References 


1. Avella-Alaminos, D., Vallejo, E.: Kronecker products and RSK-correspondences 
for 3-dimensional matrices. In preparation. 

2. Billera, L.J., Sarangarajan, A.: The combinatorics of permutation polytopes. In: 

Billera, L.J., Greene, C., Simion, R., Stanley R.P. (eds.), Formal Power Series 

and Algebraic Combinatorics 1994, DIMACS Series in Discrete Mathematics 

and Theoretical Computer Science, Vol. 24, AMS, Providence, RI, pp. 1-23 

(1996). 

Brylawsky, T.: The lattice of integer partitions. Discr. Math., 6, 201—219 (1973). 

4. Brualdi, R.A.: Matrices of zeros and ones with fixed row and column sum vec- 
tors. Lin. Algebra Appl., 33, 159—231 (1980). 

5. Brualdi, R.A.: Minimal nonnegative integral matrices and uniquely determined 
(0,1)-matrices. Lin. Algebra Appl., 341, 351—356 (2002). 


eo 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 
25. 


26. 


27. 


5 Uniqueness and Additivity for Binary Matrices 111 


Brunetti, S., Del Lungo, A., Gerard, Y.: On the computational complexity of 
reconstructing three-dimensional lattice sets from their two-dimensional X-rays. 
Lin. Algebra Appl., 339, 59-73 (2001). 

De Loera, J., Onn, S.: The complexity of three-way statistical tables. SIAM J. 
Comput., 33, 819-836 (2004). 

Díaz-Leal, H., Martínez-Bernal, J., Romero, D.: Dimension of the fixed point 
set of a nilpotent endomorphism on the flag variety. Bol. Soc. Mat. Mexicana 
(3), 7, 23-33 (2001). 

Fishburn, P.C., Lagarias, J.C., Reeds, J.A., Shepp, L.A.: Sets uniquely deter- 
mined by projections on axes II. Discrete case. Discr. Math., 91, 149-159 (1991). 
Fishburn, P.C., Shepp, L.A.: Sets of uniqueness and additivity in integer lat- 
tices. In: Herman, G.T., Kuba, A. (eds.), Discrete Tomography: Foundations, 
Algorithms, and Applications, Birkhäuser, Boston, MA, pp. 35-58 (1999). 
Gale, D.: A theorem of flows in networks. Pacific J. of Math., 7, 1073-1082 
(1957). 

Gritzmann, P., de Vries, S.: On the algorithmic inversion of the discrete Radon 
transform. Theor. Comp. Science, 281, 455-469 (2001). 

Gaiha, P., Gupta, S.K.: Adjacent vertices on a permutohedron. SIAM J. Appl. 
Math., 32, 323-327 (1977). 

Grünbaum, B.: Convex Polytopes, 2nd edition. Springer, Berlin, Germany 
(2003). 

Hardy, G., Littlewood, J.E., Pólya, G.: Inequalities, 2nd edition Cambridge 
Univ. Press, Cambridge, UK (1952). 

Klee, V., Witzgall, C.: Facets and vertices of transportation polytopes. In: Math- 
ematics of Decision Sciences, Part I. Amer. Math. Soc., Providence, RI, pp. 257— 
282 (1968). 

Kong, T.Y., Herman, G.T.: On which grids can tomographic equivalence of 
binary pictures be characterized in terms of elementary switching operations? 
Int. J. Imaging Syst. Technol., 9, 118-125 (1998). 

Kuba, A., Herman, G.T.: Discrete tomography: A historical overview. In: Her- 
man, G.T., Kuba, A. (eds.), Discrete Tomography. Foundations, Algorithms, 
and Applications. Birkhauser, Boston, MA, pp. 3-34 (1999). 

Jurkat, W.B., Ryser H.J.: Extremal configurations and decomposition theorems 
I. J. Algebra, 8, 194—222 (1968). 

Marshall, A.W., Olkin, I.: Inequalities: Theory of Majorization and Its Applica- 
tions. Academic Press, New York, NY (1979). 

Minoux, M.: Solving integer minimum cost flows with separable convex cost 
objective polynomially. Math. Programm. Stud., 26, 237-239 (1986). 
Muirhead, R.F.: Some methods applicable to identities and inequalities of sym- 
metric algebraic functions of n letters. Proc. Edinburgh Math. Soc., 21, 144-157 
(1903). 

Onn, S., Vallejo, E.: Permutohedra and minimal matrices. Lin. Algebra Appl., 
412, 471-489 (2006). 

Rado, R.: An inequality. J. London Math. Soc., 27, 1-6 (1952). 

Ryser, H.J.: Combinatorial properties of matrices of zeros and ones. Canad. J. 
Math., 9, 371-377 (1957). 

Ryser, H.: Combinatorial Mathematics, Mathematical Association of America, 
Washington, DC (1963). 

Snapper, E.: Group characters and nonnegative integral matrices. J. Algebra, 
19, 520-535 (1971). 


112 E. Vallejo 


28. Torres-Chazaro, A., Vallejo, E.: Sets of uniqueness and minimal matrices. J. 
Algebra, 208, 444—451 (1998). 

29. Vallejo, E.: Reductions of additive sets, sets of uniqueness and pyramids. Discr. 
Math., 173, 257—267 (1997). 

30. Vallejo, E.: Plane partitions and characters of the symmetric group. J. Algebraic 
Comb., 11, 79-88 (2000). 

31. Vallejo, E.: The classification of minimal matrices of size 2 x q. Lin. Algebra 
Appl., 340, 169-181 (2002). 

32. Vallejo, E.: A characterization of additive sets. Discr. Math., 259, 201-210 
(2002). 

33. Vallejo, E.: Minimal matrices and discrete tomography. Electr. Notes Discr. 
Math., 20, 113-132 (2005). 

34. Yemelichev, V.A., Kovalev, M.M., Kravtsov, M.K.: Polytopes, Graphs and Op- 
timisation. Cambridge University Press, Cambridge, UK (1984). 





6 


Constructing (0, 1)-Matrices 
with Given Line Sums and Certain Fixed Zeros 


R.A. Brualdi and G. Dahl 


Summary. Consider the class Ap(R,S) of (0,1)-matrices with row sum vector 
R, column sum vector S, and zeros in all positions outside a certain set P. It is 
assumed that P satisfies a certain monotonicity property. We show the existence of 
a canonical matrix in this matrix class and give a simple algorithm for finding this 
matrix. Moreover, a classical interchange result of Ryser is generalized to the class 
Ap(R,S) and the uniqueness question for the class Ap(R, S) is discussed. 


6.1 Introduction 


Let m and m be positive integers and let R = (ri,r2,...,r,4,) and S = 
(51,52; ..., 54) be nonnegative integral vectors satisfying 577^, ri = $5.4 $j- 


We consider the class of (0, 1)-matrices with row sum vector R and column 
sum vector S, where ones are permitted only in a set P of positions. More 
precisely, let Ap(R, S) be the set of all (0, 1)-matrices A = [a;;] of size m x n 
that satisfy 

$n (<m), 

via Wj = 8) GSN), 
and 

Qij = 0 forall (i, j) g Px (6.1) 
Define the row index sets Pj; = {i < m | (i,j) € P} (j € n). We assume 
throughout the paper that the sets P; (j < n) satisfy the following condition: 

for each i < j either P; 2 Pj or P,P; - 0. (6.2) 


Thus, if a one is permitted in position (i,j), i.e., i € Pj, then it is also 
permitted to place a one in all those positions (i,k) with k « j for which 
P Pj # 0. The following matrix illustrates a possible pattern satisfying 
(6.2), where m = 7, n = 13, and positions in P are indicated by ‘x’: 
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*0**00%* «0«0x 
*0**00%«*« «0«0x 
*0x*x00»x*0*00 
*0*x*x0000000»0]|. (6.3) 
*x0xx0000000*0 
*xx000x0003*000 
*k000*000x000 


We give some remarks related to assumption (6.2) on the sets Pj (j < n). 


(a) The following more general condition may be reduced to (6.2) after suit- 
able reordering of the Pj's: P; O Pj 2 0 or P; € P; or P; € B. 

(b) It may also be of interest to consider matrices where certain entries are 
fixed to either zero or one. By replacing the fixed ones by zeros and updat- 
ing row and column sums accordingly, one obtains an equivalent situation 
where certain entries are fixed to zero only. Thus, one obtains the situa- 
tion treated in this paper provided that the remaining positions P; satisfy 
condition (6.2). 

(c) An interesting special case is when P is specified by a vector K — 
(kı, ko,..., kn) satisfying kı > ka >--- > kn and for each j < n 


P; = (,2,..., kj). (6.4) 


In this special case the position set P forms a Young diagram, so we 
say that P is a Young-pattern given by K = (ki, k2,..., kn). Note that 
if K = (m,m,...,m), that is, P = {1,2,...,m} x {1,2,...,n}, then 
Ap(R, S) coincides with the well-known matrix class A(R, S) of (0, 1)- 
matrices with row sum vector R and column sum vector S. We refer to 
[5] for a treatment of A(R, S). 


Note also that we do not require R or S to be monotone. Here, and later, 
we call a vector monotone if its components are nonincreasing. 

'The main contribution of this chapter is to generalize some results known 
for A(R,S) to the class Ap(R,S). These results include the existence of 
a canonical matrix in the class having certain properties and a simple ex- 
tension of Ryser's algorithm for finding this canonical matrix. Moreover, an 
interchange result is generalized, and the uniqueness question for the class 
Ap(R, S) is discussed. 

A motivation for this work is in the area of discrete tomography, where 
(0, 1)-matrices with given line sums correspond to binary images with hori- 
zontal and vertical projections. If one knows that the image contains zeros 
in certain positions (satisfying (6.2)), one gets the problem discussed in this 
chapter. Moreover, this work extends some results in our previous paper [6], 
where we focused on the matrix class where P was a Young-pattern given by 
K = (m,m,...,m,k,k,..., k) for some k € m. An introduction to discrete 
tomography can be found in [12]; see also the special issue of Linear Algebra 
and Its Applications [9]. For an extensive survey of the class .A(R, S), see [5]. 
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The classical Gale-Ryser theorem says that the class A(R, S) is nonempty if, 
and only if, a simple majorization condition on R and S holds (S is majorized 
by the conjugate of R), see, e.g., the mentioned survey paper or the book [7]. 

Our work is related to paper [8], where Chen considered the class of inte- 
gral matrices with given row and column sums and where each entry lies be- 
tween a given lower and upper bound. This class includes our class Ap(R, S). 
The paper [8] contains an existence characterization of majorization type and 
studied an associated structure matrix. However, these results were shown 
under certain assumptions on the given vectors R and S and on the set P 
that do not include the situation considered in this chapter. A similar char- 
acterization in a restricted situation are found in [15]. In [3] a theorem was 
proved for a nonempty class A(R, S), where R and S are monotone and of 
the same size n. The theorem gives a simple necessary and sufficient condition 
for the class to contain a triangular matrix; in our notation this corresponds 
to a Young-pattern with K = (n,n — 1,...,1). This condition is simply that 
ri <n—i+l and s; <n—i+1 (1 <i € n). Moreover, in [4] Anstee gen- 
eralized the Gale-Ryser theorem to the situation where certain entries were 
fixed to 1 (but at most one such fixed 1 in each column). A related result was 
established in [11], where Fulkerson gave a characterization of the existence of 
(0, 1)-matrices with given line sums and zero trace. More recently Kuba ([13]) 
discussed when A(R, S) contains a unique matrix with certain fixed zeros and 
ones. He characterized this situation in terms of certain sequences and also 
considered a reconstruction algorithm. Finally, we mention that the problem 
of constructing a matrix in A(R, S) with fixed ones and zeros in any set of 
positions may be solved efficiently as a maximum flow problem in a certain 
capacitated digraph (see the discussion of the feasible network flow problem 
in, e.g., [2] or [7]). Still, it is of interest to find theoretical results and simpler 
algorithms for more restricted situations, and the present chapter has this 
focus. 

In this chapter, by the class of a matrix A we mean the class of (0, 1)- 
matrices of the same size as A and having the same row and column sum 
vectors as A. For a matrix A and index sets I and J, we let A(T, J) denote 
the submatrix of A consisting of the entries in rows indexed by J and columns 
indexed by J. Similarly, A(:, J) is the submatrix of A containing (all rows 
and) columns indexed by J, and A(:, j) is the jth column of A. 


6.2 The Canonical Matrix 


For the class A(R, S) the well-known Ryser's algorithm, introduced in [16], 
may be used to find a matrix in that class. The algorithm is simple (and 
efficient), and it actually constructs a canonical matrix in the class having 
certain properties; see [5]. We now follow a similar pattern for the more general 
class Ap(R,S) by performing the following steps: 


1. Show the existence of a canonical matrix À in the class Ap(R, S). 
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2. Use the special properties of this canonical matrix A to set up an algorithm 
for finding A column by column. 


Before we start we remark, one may also use general network flow algo- 
rithms (see [2]) for finding a matrix in Ap(R,S), but our algorithm is much 
simpler (and faster). 

As for the basic class A(R, S), the notion of an interchange is a central in- 
gredient in our approach. An interchange applied to a (0, 1)-matrix A replaces 
a submatrix of order 2 of one of the forms 


T = la | use h J (6.5) 


by the other. 

Ryser proved that given two matrices A and B in A(R,S), A can be 
transformed into B by a sequence of interchanges. Let A € Ap(R,S) and 
assume that the 2 x 2 submatrix A[{i, j}, {k,1}], where i < j and k < l, is 
equal to Tı. This implies that i € P; and j € Pj. Assume, furthermore, that 
i € B. Then PNO P, Z 0 (asi € POP), so by assumption (6.2) we must have 
P, C Py. Therefore, the matrix obtained from A by replacing the mentioned 
submatrix by T> will also lie in Ap(R,S). This operation will be called a 
T,T2-interchange. The operation T5T1-interchange is defined similarly. 


Lemma 1. Assume that .Ap(R, S) is nonempty. Then Ap(R, S) contains a 
matrix A where the 1’s in column n appear in those rows in P, in which r; is 
largest, giving preference to the bottommost positions in case of ties. 


Proof. Let A € Ap(R,S). Assume that i,k € P, are such that ri > fk, 
Qin = 0, and aj, = 1. Then (as r; > rg) there must be a j < n such that 
aij = 1 and ag; = 0. We then apply a (T1T5- or T5Ti-) interchange to A 
(involving rows i, k and columns j, n) and thereby get a matrix that also lies 
in Ap(R, S) as i,k € P, C Pj. Note that here we used the chain assumption 
(6.2). Repeating this operation results in a matrix A! having the 1’s in column 
n in those rows in P, in which A has the largest row sums. Assume now that 
i,k € P, are such that i < k, ri = ry, Gin = 1, and akn = 0. Then there must 
be a j < n such that aj; = 0 and ag; = 1 so we apply an T2T3-interchange 
involving rows i,k and columns j,n. The new matrix lies in .Ap(R, S) as 
i,k € P, C Pj. By repeating this operation, we obtain a matrix A with the 
desired properties. 














The matrix A described in Lemma 1 may not be unique. But the last 
column is unique, and it is found directly from Pa, R, and sn. We call any 
such matrix A a last-column canonical matrix in the class Ap(R, S). Note 
that all matrices obtained from a last-column canonical matrix by deleting its 
last column must belong to the same matrix class (as the last column of Ais 
fixed). 

We now apply the result in Lemma 1 recursively. 
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Theorem 1. Assume that Ap(R, S) is nonempty. Then Ap(R, S) contains a 
unique matriz A such that for each k < n the submatrix Ak consisting of the 
first k columns of A is a last-column canonical matrix in its class. 

Proof. We shall construct A column | by column. We may, using Lemma 1, 
find a last-column canonical matrix A in Ap (R, S) and set A(:,n) = A(:,n). 
Consider next the matrix A' — AC: :, (1,2,...,n—1]) and choose a last-column 
canonical matrix B in the class of A’. Let column n—1 of A be equal to the last 
column of B. Proceed like this and construct columns n — 2,...,1 of A. Then 
A is the unique matrix with the desired properties stated in the theorem. 














The matrix A described in Theorem 1 is called the canonical matrix in 
the class Ap(R,S). In the next section we discuss an algorithm for finding A. 


6.3 The Generalized Ryser Algorithm 


'The proof of Theorem 1 is constructive and it actually contains an algorithm 
for finding the canonical matrix. We call this procedure the generalized Ryser 
algorithm as it specializes into Ryser’s algorithm when P = (1,2,...,m] x 
1. 2 ccn. 


Generalized Ryser Algorithm 


1. (Initialize) Let k = n and let R= R. 

2. (Determine column k) Find the indices im Py corresponding to the sy 
largest positive components of 1 R, where we prefer largest indices in case 
of ties. Let the kth column of A have ones in the sy positions just found. 

3. (Update row sum) Let R = R — A(:,k) (the row sum vector after the last 
column has been deleted). If k > 1, reduce k by 1 and go to step 2. 


Due to Theorem 1, this algorithm will find the canonical matrix A provided 
the class is nonempty. If the class is empty, the procedure will break down at 
some stage. We see that this happens if, in an iteration k, the present row 
sum vector R has fewer than s; ones in the positions corresponding to PX. 
This may be seen as an algorithmic characterization of the nonemptyness of 
Ap(R, S). 

The algorithm may alternatively be presented as follows. We start with 
the matrix A, which is the (0, 1)-matrix of size m x n whose ith row consists 
of r; leading ones followed by n — r; zeros for à < m. Then we shift ones to the 
right as in Ryser's algorithm, except that we use only the feasible positions 
Py, in each column k. Thus, the procedure is 
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Start with A — A and construct column n by shifting ones to the right 
so that ones occur in those rows of Pa, where A has largest row sums, 
using the bottommost position in case of ties. Proceed to construct 
column k similarly by shifting in rows in Py fork 2 n — 1,...,1. 


If the procedure breaks down at some point, the only reason can be that 
the matrix class Ap(R, S) is empty. 


Example 1. Let m — 5, n = 8, R = (5,5,4,4,3), S = (3,3,3,3,3,2,2,2), and 
let P correspond to the Young-pattern given by K — (5,5,5,5,4,4,3,3). The 
generalized Ryser algorithm proceeds as follows (where the forced zeros are 
shown in boldface): 


11111000 11110001 11110001 11100101 
11111000 11110001 11100011 11100011 
11110000|—1|11110000|—/|11100010|—— 11100010) — 
11110000 11110000 11110000 11100100 
11100000 11100000 11100000 11100000 
11100101 11010101 11010101 11010101 
11001011 11001011 10101011 10101011 
11001010|,—/|11001010|—/|10101010|—/|10101010 
11001100 10011100 10011100 01011100 
11100000 11010000 10110000 01110000 


6.4 Interchanges 


A central result concerning the matrix class A(R, S) was proved by Ryser in 
[16]. It says that one can transform any given matrix in A(R, S) into another 
given matrix using a sequence of interchanges. It is natural to ask if a similar 
property holds for the more general class Ap(R, S), and the following theorem 
answers this question positively. 


Theorem 2. Let A and B be two given matrices in Ap(R,S). Then there is 
a sequence of interchanges that transforms A to B with every intermediate 


matriz in .Ap(R, S). 


Proof. The proofs in our Lemma 1 and Theorem 1 show that any matrix in 
the class Ap(R, S) may be transformed using interchanges into the canonical 
matrix A in that class, and all intermediate matrices belong to the class. 
Therefore, both A and B may be transformed into A, so by reversing the latter 
interchanges we get the desired sequence of interchanges that transforms A 
into B. Note here that all the interchanges involved here are “feasible,” i.e., 
they don't put ones in forbidden positions. 














An interesting question is to find a more *direct" proof of this theorem 
which finds suitable interchanges without going via the canonical matrix A. 
In the case of A(R, S), such an argument is known; actually the original proof 
of Ryser was along these lines. 
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6.5 Uniqueness and a Reconstruction Algorithm 


From the point of view of discrete tomography, it is of particular interest 
to know when .Ap(R, S) contains a unique matrix A = [a;;]. Such a matrix 
A is then uniquely reconstructible given its row and column sum vectors R 
and S and its prescribed zeros as specified by the pattern P. Unique recon- 
structability of A is equivalent to the uniqueness of a solution of the system 
of equations 


> ay=s; (ISIS), (6.7) 
E aj =Oorl ((üj)€ P), (6.8) 
aj =0 ((íj)€ P). (6.9) 


Such uniqueness questions have been studied in [1] in a more general context 
where certain entries are prescribed to be 0 as in (6.9) and certain other entries 
are prescribed to be 1. Also, uniqueness of particular entries is considered. It is 
to be observed that the coefficient matrix corresponding to the linear system 
(6.6) and (6.7) is totally unimodular (see, e.g., Lemma 3.1 of [1]), and this 
facilitates the analysis. From the general theory developed in [1], there follows 
the uniqueness characterization given in [10] in the case that no entries are 
prescribed. The relevant concept is that of additivity, where an m by n (0, 1)- 
matrix A = [a;j] is additive provided there are real vectors (z1,22,..., Em) 
and (yi, ya, ..., y») such that aij = 1 if, and only if, z; + y; > 0. Uniqueness 
of A is equivalent to additivity of A. 

We first discuss uniqueness for a special Young-pattern and then the same 
question is studied for a general pattern P. We call a (0, 1)-matrix Y a Young 
matrix if each row and column in Y consist of a sequence of 1’s followed by a 
sequence of 0’s, i.e., the ones in Y determine a Young-pattern. 

Let P be a Young-pattern given by K = (kı, k2,...,kn) for which the 
class .Ap(R, S) is nonempty. First assume that K = (m,m,...,m). Then 
Ap(R, S) — A(R, S), and without loss of generality we may assume that R and 
S are monotone nonincreasing. Then it is well known [5] that A(R, S) contains 
a unique matrix if, and only if, S equals the conjugate R* = (rf,r3,..., r7) 
of R, in which case the unique matrix is the Young matrix A with row sum 
vector R* (its ith row consists of r7 ones followed by zeros). 

We now consider the case studied in [6], where K = (m,m,...,m,k,k,..., 
k) (p m’s) for some positive integer k < m and some positive integer 
p < n. Thus, every matrix in Ap(R, S) has an (m — k)-by-(n — p) zero 
matrix in its lower right corner. Without loss of generality we may as- 
sume that R! = (ri,...,ry), R” = (rgji,..., T), S’ = (51,...,55), and 
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S" = (Sp41,---, $n) are monotone nonincreasing. The following lemma ex- 
plains, whenever Ap(R,S) contains a unique matrix, the structure of this 
matrix. 


Lemma 2. Assume that Ap(R, S) contains a unique matrix 


_ |442 
inja, T 


where A; is a k by p matrix. Then A1, A», and Aa are all Young matrices. 


Proof. From the uniqueness assumption, it follows that the submatrix A' — 
[ A; A2] is unique in its class. Let B’ be obtained from A’ by permuting 
columns so that B’ has nonincreasing column sums. Since B’ has monotone 
row and column sum vectors (the row sum vector of B’ is R’) and B’ is unique 
in its class (as A’ is), it follows that B’ must be a Young matrix. Therefore, 
A’ is obtained from a Young matrix by permuting columns and, moreover, 
its submatrix Ag is itself a Young matrix. The last property follows from the 
fact that the column sum of A» is S", which is monotone. A similar argument 
proves that A3 is a Young matrix and that A; must be obtainable from a 
Young matrix by permuting its rows. Finally, we note that the properties 
established for A; imply that A; must be a Young matrix. 














Consider a matrix A as in Lemma 2, where A1, Ag, and A3 are Young 
matrices. Assume that A has no row or column that contains only ones or 
only zeros. Then the last column of A, must be the zero vector and the entry 
in the upper right corner of A3 must be a 1. Therefore, the first row of A3 is 
an all-ones vector. Similarly, the first column of A2 contains only ones. Thus, 
the following recursive procedure determines if a given matrix A € A(R,S) 
is unique: Delete an all-zero or all-ones row or column, or delete a row where 
the corresponding row in A3 contains only ones, or delete a column where 
the corresponding column in A» contains only ones. The initial matrix A is 
unique if, and only if, we end up with the empty matrix in this procedure. 

We now generalize these ideas and discuss the uniqueness question for a 
general pattern P satisfying (6.2). To this end it is useful to discuss the prob- 
lem in terms of certain graphs. To each matrix A € Ap(R, S), we associate 
a directed graph GA constructed as follows (a similar construction is used 
n [17]). Ga contains vertices u1, U2,..., Um and v1, v2,..-,Un- Moreover, for 
each (i,j) € P, there is an arc in Gy: This arc is (uj, vj) if aj; = 1 and it 
is (vj, u;) if aj; = 0. Note that G4 represents A uniquely (different matrices 
correspond to different such graphs). 


Theorem 3. Let A € Ap(R, S). Then the following statements are equiva- 
lent: 

(i) A is the unique matriz in Ap(R, S). 

(ti) Ga is acyclic. 

(iii) Ga contains no directed cycle of length 4. 
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(iv) Starting with A and recursively deleting rows and columns of all 0’s in 
positions of P, or rows and columns of all 1’s in positions in P, one eventually 
obtains the empty matrix. 


Proof. A directed cycle in G4 corresponds to an even sequence of positions 
in P 

(i1, j1), (i2, j1), (i2, j2), (ia, j2), S (it, jt), (i1, Jt) (6.11) 
such that the corresponding values a;j is an alternating sequence of 0’s and 
l's. By interchanging 0’s and 1’s in this sequence, we obtain another matrix 
A' € Ap(R, S) (as line sums are preserved and no positions lie outside P). 
In GA this operation corresponds to reversing the direction on all arcs in 
the cycle. From this it is clear that (i) implies (ii). The converse implication 
also follows as if A, A’ are two matrices in .Ap(R, S), then the positions where 
aij F ai may be partitioned into disjoint sets corresponding to cycles as above 
(see [5] for details). Note that all these positions lie inside P as aij = aj; = 0 
for all (i, j) € P. This proves that statements (i) and (ii) are equivalent. 

The equivalence of statements (i) and (iii) follows from Theorem 2 as 
going from a matrix A to another matrix A’ via an interchange corresponds 
to reversing the direction of a directed cycle of length 4 in the graph G4. 

Finally, statement (iv) is equivalent to statement (ii) since the recursive 
procedure is an algorithm for testing if the directed graph GA is acyclic. It 
deletes vertices with indegree or outdegree zero as such vertices cannot belong 
to any cycle. If, eventually, all vertices are deleted, then there is no cycle. 
Otherwise, one finds a subgraph where each vertex has an ingoing arc and an 
outgoing arc, and this subgraph contains a directed cycle. 














We remark that in this theorem the equivalence of statements (i), (ii), 
and (iv) holds for any pattern P C (1,2,..., m] x {1,2,...,n}, while the 
equivalence to statement (ii) depends on our standard assumption (6.2) on P. 

From Theorem 3 we may derive a simple reconstruction algorithm of in- 
terest in discrete tomography. 

Reconstruction Algorithm 


1. Initialize A by setting aj; = 0 for all (i, j) ¢ P. 

2. Find a line in the matrix such that its line sum, together with the informa- 
tion provided by P and the previously determined entries in the algorithm, 
forces all remaining entries to be all one or all zero. 

3. Set these entries accordingly, and return to step 2 until A is completely 
specified. 


'There are two possible outcomes of the algorithm: Either all entries are 
determined and then the constructed matrix is unique in the class, or the 
algorithm stops prematurely because of a “nonunique” line and then either 
the class is empty or it contains at least two matrices. 


Corollary 1. Assume that .Ap(R, S) is nonempty. Then the reconstruction 
algorithm determines if the class Ap(R,S) contains a unique matriz and, if 
so, it finds this unique matriz. 
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Proof. The correctness of the algorithm follows from statement (iv) in Theo- 
rem 3. The algorithm constructs the matrix in the reverse order compared to 
the deletion process described in statement (iv). 














In the reconstruction setting of images in discrete tomography, an image 
is known to exist, and then the algorithm may be used for testing if the image 
is unique and finding this image if uniqueness holds. 

As a concluding remark, we believe that it may be difficult to find sim- 
ple conditions on R, S, and P that characterize the uniqueness in the class 
Ap(R, S). It would be of interest to find such a characterization although 
the simple algorithm above provides an algorithmic answer to this uniqueness 
question. 


6.6 Conclusions and Summary 


We have shown that some of the classical results known for A(R, S) generalize 
to the class Ap(R, S): (i) the existence of a canonical matrix; (ii) a simple 
algorithm for finding a matrix in the class; and (iii) the possibility of going be- 
tween any pair of matrices in the class by using a sequence of interchanges. It 
is interesting that all these results hold although a nice and simple majoriza- 
tion characterization of the nonemptyness of Ap(R, S) is not present. Finally, 
we have given a simple reconstruction algorithm for the class Ap(R, S). 
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Reconstruction of Binary Matrices under 
Adjacency Constraints 


S. Brunetti, M.C. Costa, A. Frosini, F. Jarray, and C. Picouleau 


Summary. We are concerned with binary matrix reconstruction from their orthog- 
onal projections. To the basic problem we add new kinds of constraints. In the first 
problems we study the ones of the matrix must be isolated: All the neighbors of a one 
must be a zero. Several types of neighborhoods are studied. In our second problem, 
every one has to be horizontally not isolated. Moreover, the number of successive 
zeros in a horizontal rank must be bounded by a fixed parameter. Complexity results 
and polynomial-time algorithms are given. 


7.1 Introduction 


Given a binary matrix, its horizontal and vertical projections are defined as 
the sum of its elements for each row and each column, respectively. The recon- 
struction of a binary matrix from its orthogonal projections has been studied 
by Ryser [22, 23]. One can refer to the book by Herman and Kuba [17] for 
further information on the theory, algorithms, and applications of this classi- 
cal problem in discrete tomography. It is well-known that this basic problem, 
where the only constraints to verify are both projections, can be solved in 
polynomial time. Numerous studies deal with this problem when additional 
constraints have to be taken into account. Some examples of such constraints 
are different kind of convexity [6, 12, 16, 20], connectedness [4, 7, 24], pe- 
riodicity [15, 19], and Gibbs priors [10]. In this chapter we study two other 
constraints for this problem. In a first part we take into account the “non- 
adjacency” constraint that depends on the definition of neighborhood: If a 
cell value is 1, then the values of each one of its neighbors must be 0. This 
problem arises especially in statistical physics to determine the microscopic 
properties (energy, density, entropy). In the model of hard square gas, two 
adjacent cells cannot be occupied simultaneously by particles because the en- 
ergy of repulsion between them is very high (see, for instance, [5] or [9]). In 
this part several kinds of neighborhoods will be studied. 

In the second part, we are concerned with a different constraint that im- 
poses that in every row of the binary matrix no isolated cell with value 1 is 
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permitted, and the maximum number of consecutive cells of value 0 is not 
greater than a prescribed integer k. Taking into account this constraint, the 
reconstruction of binary matrices has natural applications in timetabling and 
workforce scheduling. Indeed, given a company with a certain number of em- 
ployees and a planning horizon of a certain number of days, 0’s correspond to 
the working days of the employees and 1's corresponds to their days-off; the 
horizontal projections represent the total amount of days-off for the employ- 
ees in the considered period, and the vertical projections give the number of 
employees at rest during the days. Hence, the social constraints state that a 
day-off cannot be isolated and an employee cannot have more than k succes- 
sive working days. The reader can see, for instance, [2, 3, 8, 18] for some other 
days-off assignment problems. 


7.2 Preliminaries 


In this section we introduce notations and definitions that we use in the fol- 
lowing sections. 

Given an m x n binary matrix A, we denote its horizontal projection 
by H = (hi,...,hm), hi being the number of 1’s in row i, and its vertical 
projection by V = (vi,..., Un), vj being the number of 1’s in column j (see 
Fig. 7.1 for an example). The condition 5 7^ , hi = 355 , vj, denoted by (Co), 
is obviously necessary for the existence of a binary matrix respecting both 
projections in the problems we study in this chapter. Specially, we will consider 
the reconstruction problem from the horizontal and vertical projections for two 
classes of binary matrices obtained by imposing the nonadjacency constraint, 
and the timetabling constraint. 








Fig. 7.1. A binary matrix with projections H — (3,4,2, 3) and V — (2,4,3,2, 1). 


In order to define the nonadjacency constraint we recall several definitions 
of the neighborhood of a given cell a;,; of the matrix A. We speak of 2- 
adjacency if the two neighbors of a; j are ai j+1 and ai j—1, i.e., its horizontal 
adjacent cells; 4-adjacency if the neighbors are a;,;—1,@i,j41,@i-1,;, and aj41,;, 
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i.e., the horizontal and vertical adjacent cells; 6-adjacency if, in addition to the 
previous ones, we add the neighbors on one diagonal: a;_1,;-1 and ai+1,j+1- 
Eventually, in the S-adjacency the neighbors of a; j are its eight adjacent cells 
Qj,j—1; Qi, 54-1; Qi—1,5; Qi 1,5; Qi—1,5—1» Qi—1,j+1,; Gi+1,j-1, and Qj--1,34-1- We say 
that matrix A fulfills the non p-adjacency constraint if a;,; = 1 implies that 
its p-adjacent cells have value 0. Figure 7.2 illustrates the non p-adjacency for 
p =2,p=4,p = 6, and p = 8 (from the left to the right). The class of binary 
matrices that fulfill non p-adjacency is denoted by Np, p € (2,4,6,8]. 
































0 0|0 0|0 
0|1/|0 0 0 0 0 1|0 
0 010 0|0/|0 

















Fig. 7.2. Examples of the non (2, 4,6, 8}-adjacency (from the left to the right). 


We say that matrix A fulfills the timetabling constraint if a;,; = 1 implies 
àjj-1 = 1 or aij+1 = 1, and the number of successive 0’s in each row of 
A is less than or equal to a given integer k > 2. Figure 7.3 shows a matrix 
that fulfills the timetabling constraint for k = 3. Given k, the class of binary 
matrices that fulfills timetabling constraint is denoted by 7;. 


ESESESICIEIHEIESESESESEHR 


BRR aes 
ENQUERRE 


Fig. 7.3. A matrix of 73. 





7.3 Reconstructing Matrices under Nonadjacency 
Constraints 


Let us consider the problem of reconstructing an m x m binary matrix of 
Np from its horizontal H = (hi,...,h,,) and vertical V = (v1,...,Un) pro- 
jections, where V; is the class of matrices that fulfill non p-adjacency con- 
straints. We denote this problem by Reconstruction, (m, n). In this section 
we deal with this problem and its modifications. In particular, we show how 
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to solve Reconstructiony, (m, n) in polynomial time. In Subsection 7.3.2, 
conditions for the existence of a solution to Reconstructiony, (m, n) are 
given, and we solve it for matrices with two or three rows (m — 2,3). Fi- 
nally, we prove that Reconstructiony,(m,n) is NP-complete, and that 
Reconstruction, (m, n) is as difficult as the well-known open problem of 
reconstructing a three-colored matrix [13]. 


7.3.1 Binary Matrices under Non 2-Adjacency Constraint 


We show that Reconstruction, (m, n), the problem of reconstructing a 
binary matrix under non 2-adjacency constraint from orthogonal projections, 
is equivalent to a particular packing reconstruction problem that can be solved 
in polynomial time. 

In particular, we consider the horizontal domino packing problem. In this 
problem, every tile is a horizontally oriented domino, i.e., an array 1 x 2 or 
equivalently two consecutive cells in a row, and the packing consists in a 
placement of nonoverlapping horizontal oriented dominoes in a m x n array 
satisfying the orthogonal projections, i.e., a given number of distinct tiles in 
each row and column. 


Lemma 1. Reconstruction, (m, n) is equivalent to the horizontal domino 
packing problem. 


Proof. Consider an instance of Reconstruction,,(m,n) defined by H = 
(hi,ho,..., hm) and V = (v,vo,..., Un). Without loss of generality, we can 
suppose that vn = 0. 

'The two problems are equivalent, since it is easy to see that the in- 
stance of the horizontal domino packing problem can be obtained by H, and 
V simply, and vice versa, and a solution for the horizontal domino pack- 
ing problem corresponds to a solution of Reconstructiony, (m, n) replac- 
ing every domino by two cells of values 1 (the leftmost) and 0 (the right- 
most), and vice versa, two consecutive cells of Reeconstructiony, (m, n) 
with values 1 and 0 by a domino. Indeed, denoted by H’,V’, the orthogo- 
nal projections of the horizontal domino packing problem, H — H' and V — 
(vi, vb —vi, v — (v5 —vi), v — (04 — (0h —1)) .. v — aare (0$ vi). 
This latter vector counts the number of horizontal dominoes starting in every 
column [21]. Since two consecutive dominoes in a row cannot start at consec- 
utive columns, the first cells of each domino in a packing satisfying H' and 
V’ also fulfill the nonadjacency constraint and satisfy H and V. 




















Since there exists a polynomial-time algorithm for the horizontal domino 
packing problem [21], we can give the following theorem. 


Theorem 1. Reconstructiony, (m, n) can be solved in polynomial time. 
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7.3.2 Binary Matrices under Non 4-Adjacency Constraint 
Reconstruction from One Projection 


We are going to consider the problem of reconstructing a binary matrix under 
the non 4-adjacency constraint from one projection. Without loss of generality, 
we consider the horizontal projection. To solve Reconstructiony,(m), we 
shall use an “underlying” chessboard C of size m x n to provide forbidden 
positions (i.e., cells that cannot have value 1) and free positions (i.e., cells that 
can have either value 1 or value 0) for A. Notice that a chessboard verifies the 
non 4-adjacency constraint. In the chessboard, the cells c; j such that i+ j is 
even (respectively, odd) have the same value as c1, (respectively, co,1). Thus, 
there are two possible configurations of C depending on the value of c; 1. 
Consider the configurations resulting from c;,; = 1: If n is even, there are 
n/2 1’s in each row, and if n is odd, there are (n + 1)/2 1’s in odd rows and 
(n — 1)/2 1’s in even rows; such a chessboard is the most dense binary matrix 
respecting the non 4-adjacency constraint. 

Now, we are going to give some necessary and sufficient conditions on H, 
depending on the parity of n, for the existence of an m x n matrix A € M4 
with horizontal projection H. We call Consistency y, (m) the corresponding 
decision problem. As a byproduct, we shall get a method to build A if it exists. 

First, note that h; 1’s are required in each row i, and so at least h; — 1 0’s: 
Thus, there is a solution only if, for all i, hi + hi — 1 € n or equivalently, for 
all i, hi € (n + 1)/2. This condition, denoted by (C1), is necessary whatever 
is the parity of n. 


Proposition 1. If n is even, then hy € (n+1)/2, for i = 1,...,m, isa 
necessary and sufficient condition for Consistency y, (m). 


Proof. The necessity has been proved above. Let us prove the sufficiency. In 
each row i, n being even and h; being integer, h; < (n+1)/2 sh; € n/2. 
Consider a chessboard: It has 7 1’s per row; by choosing exactly h; l's per 
row and setting the other cells to 0, we get the solution. 











Nevertheless, this condition is not sufficient in the case where n is odd: Just 
consider two adjacent rows with hi = h;,1 = (n + 1)/2; the condition (C4) 
holds but there are 1’s in vertically adjacent cells and the non 4-adjacency 
constraint is violated. When n is odd, we need to introduce more definitions. A 
row i is said to be saturated if h; = (n+ 1)/2; that means that there must be 
a l in each odd column. We said that two saturated rows 7; and iz are consec- 
utive if there is no saturated intermediate row between them (Vi à < i < i2, 
row i is not saturated) (see Fig. 7.4). Let us denote by A;,;, the submatrix of A 
restricted to rows i such that i; < i < i2, and by Consistency y, ([i1, i2]) the 
corresponding subproblem. Obviously, Consistency y, (m) is satisfied only 
if for any consecutive saturated rows i; and iz, Consistency y, ([i1,i2]) is 
satisfied. 
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Fig. 7.4. A matrix of N4 with n odd. 


Note that, as seen above, if two adjacent rows are saturated, then A cannot 
exist. Thus, from now we assume that 49 > 7, + 1. 

Let k = i—i — 1 > 1 be the number of intermediate rows in 
Aiia. The next three propositions give conditions on h;'s that ensure that 
Consistency p, ([i1,i2]) is true. These propositions are based on the par- 
ity of k. For an intermediate (unsaturated) row i, n being odd, we have 


Proposition 2. If n is odd and k is odd, then a necessary and sufficient 
condition for Consistency y, ([i1,12]) is that hy < (n — 1)/2, for all i such 
that iq <i < i2,. 


Proof. Just recall that a chessboard in Aj,;, has at least (n — 1)/2 1’s per row 
i (but no more than (n + 1)/2), à € i € i2. o 
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When n is odd and k is even, it is not possible to cover Aj,;, with a 
chessboard (see Fig. 7.4). In a first step we give a necessary condition for the 


existence of A. We call it (C2); it is that p» dm hy € kn/2 — (n + 1)/2. 


Proposition 3. If n is odd and k is even, then (C2) is a necessary condition 
for Consistency y, ([i1, i2]). 


Proof. Rows i; and iz are saturated. In the set of intermediate rows i, 41 < 
i < iz, there are at most k/2 1’s in an even column and &/2 — 1 Is in an odd 
column. n being odd, there are (n — 1)/2 even columns and (n+ 1)/2 odd 
columns. Hence the total number of 1's in the intermediate rows is at most 
(n — 1)/2 x k/2 4- (n+ 1)/2 x (k/2 — 1) = kn/2 — (n + 1)/2; thus, (C2) must 
be satisfied. 














Now, let us show that when n is odd and k is even, (C1) and (C2) are 
sufficient for a solution to exist. Let f be the following integer function defined 
from {i = i1,..., i2 — 1} to [L,..., n] by 
(a) fla) — 1, 

(b) f( +1) 2 n— 2hi,41, 
(c) f(i) = min(n, f(i — 1) - n — 2h,), for i = i1 +2,...,42 — 1. 

Without loss of generality, we assume now that i; is even. (If ių is odd, 
just exchange in the following on the one hand “odd” and “even” and on the 


other hand “have the same parity" and “have not the same parity." ) 
Let ? be the smallest i such that f(î) = n. 


Property 1. ? is such that $ < tg. 


Proof. Assume that ? > i», that is, f(¢2 — 1) < n. By induction, f(i9 — 1) = 


f(i 2) n 2h, = + = Diin n- 2hikn— 2 2G hi Recall that 
(C2) is RM, hi € kn/2 — (n + 1)/2; hence, we get n -- 1 € f(ig — 1) <n, 











a contradiction. 





Property 2. f(t; +1) > f(t) and f is increasing on {i1 + 1,..., i]. 


Proof. For ij < i < i2, we have hy € (n — 1)/2, i.e., n — 2h; > 1. Then 
for i =7,41,...,2. 














Property 3. For each 4 such that i; < i < î, i and f(i) have the same parity. 


Proof. Recall that 41 is even. Since n is odd, f(i1 +1) is odd and the relation 
is satisfied for i = 41 + 1. For à > i1 4-1, f(i) =f(i— 1) 4- n — 2h;, then by 
induction i and f(i) have the same parity. 
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Now, set to 0 all the cells in ((4, j) | i = 414-1,...,5 j = f(i-1),..., fF} 
That defines “stairs” from column 1 to column n, splitting up the submatrix 
Aiia into two parts separated by these 0’s (see Fig. 7.4). Let us cover the 
upper right part by a partial chessboard with c;,ı = 1, and the left lower 
part by another partial chessboard with c;,;, = 1. 21 is even, à = $1 +k+1, 
and k is even, thus i9 is odd: The upper right partial chessboard has 1’s in 
the cells (i, j) such that i+ j is odd, whereas the lower left partial chessboard 
has 1’s in the cells such that i+ j is even. Clearly, the obtained binary matrix 
satisfies the non 4-adjacency constraint. 


Property 4. For à, à < i < 1, there are exactly h; 1’s in each row i. 


Proof. Let us consider a row ? with i; < à < i. There is a stair of cells with 0 
from column f (i — 1) to column f(i). The cells (i, 1), (é,2),...,(¢, f(—1) — 1) 
are on the left of the stairs and belong to the lower left partial chessboard; 
note that for i = 4 + 1, there are no cells on the left side. The cells (i, f (i) + 
1), (i, f (à) + 2),..., (à, m) are on the right of the stair and belong to the upper 
right partial chessboard. The cell (i, f(¢ — 1) — 1) is adjacent to the stair by 
the left and from Property 3, i+ f(i — 1) — 1 is even and the cell (i, f (4) + 1) 
is adjacent to the stair by the right and from Property 3, i + f(z) + 1 is odd: 
Thus, both cells have a 1. There are [(f(i — 1) — 1)/2] 1’s on the left of the 
stair, and [(n — f(i))/2] 1’s on the right of the stair. Thus, the total number 
of I's in row 7 is [(f(i — 1) — 1)/2] + [(m — f(@))/2] 2[(f(i — 1) — 1)/2] + 
[QA — fG- 1/21 = hi [UG — 1) 1)/2 + [-fG — 1/2] = hs. 


Property 5. There are at least h; 1’s in each row 7,7 < i € i9 — 1. 




















Proof. First, consider the row 7. Since ? and f(i — 1) — 1 have the same parity, 
there is a 1 in the cell (7, f(@ — 1) — 1). So there are [(f(? — 1) — 1)/2] 1’s on 
the left side of the stair in row 7. We have f(?i) 2 n € f($ — 1) 4- n — 2h; 
thus, 2h; € f(6—1) > h; € |f($ — 1)/2]2[(f($ — 1) — 1)/2]. Second, in each 
row i, 7 < i € i2 — 1, there are at least (n — 1)/2 1’s. These rows being not 
saturated, we have h; € (n — 1)/2. 














We are ready to establish the following proposition: 


Proposition 4. If n is odd and k is even, then Consistency y, ([i1,i2]) is 
satisfied if, and only if, (C1) and (C2) hold. 


Proof. (C4) and (C2) are necessary conditions. Now, if (C1) and (C2) are 
satisfied, to get a solution to Reconstructiony, ([i1,i2]), we have to build 
the stairs of 0’s as explained above. Then, from Property 4 and Property 5, 
each row i, i1 < i < i2, has at least h; 1’s; one can select exactly h; 1’s to get 
a solution to Reconstructiony, ([i1, i2]). 














Now let the row r be the first saturated row of A and, if r Z 1, consider 
the subproblem Reconstructiony, ([1, r]) corresponding to the r first rows: 
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It has a solution if, and only if, the condition (C1) is satisfied for i = 1,...,r. 
'The proof is the same as the proof given for Propositions 1 and 2. In addition, 
the subproblem Reconstructiony, (|[s, m]), where s < m is the last saturated 
row, can be solved as well. 

Eventually, when n is odd, to solve Reconstructiony,(m) we have to 
solve each subproblem Consistency y, ([i1,i2]), where i; and i? are two con- 
secutive saturated rows. If all these subproblems are satisfiable, putting to- 
gether the solutions of the corresponding Reconstruction, ([i1, 12]) provides 
a solution to Reconstruction, (m); otherwise, Consistency y, (m) is not 
satishable. Now, we obtain the following result: 


Theorem 2. Reconstructiony,(m) has a solution if, and only if, (C1) 
holds and either n is even or n is odd and (C2) holds for each pair (i1, i2) 
of consecutive saturated rows such that ig — i, — 1 is even. 


Note that if Reconstruction, (m) has a solution, then it can be obtained 
in a linear time using chessboards as indicated in the proofs above. 

All the results obtained in this section can be settled symmetrically for 
the problem Reconstructiony, (n), where instead of a vector H of horizontal 
projections, we are given a vector V of vertical projections. 


Reconstruction Satisfying the Orthogonal Projections 


It seems that the complexity of Reconstructiony,(m,n) is still an open 
problem. We first propose a sufficient and then a necessary condition for 
the existence of a solution to Reconstructiony,(m,n), then we solve this 
problem in the special cases m = 2 and m = 3. 


Sufficient Conditions for the Existence of a Solution to 
Reconstructiony, (m, n) 


Let Ho = (hi, ha, eg hop-1); p = [m/2], and Vo = (t1, Us, was ,U2q—1), 
q = [n/2] (respectively, Hg = (h2,h4,...,h2p), p = |m/2], and Vg = 
(v2, V4, ..., U24), q = |n/2]), be the projections of the odd (respectively, even) 
rows and columns. The existence of [m/2] x [n/2] binary matrices Ao € M4 
satisfying Ho and Vo and Ag € V4 satisfying Hg and Vg implies that there 
is a solution to Reconstructiony, (m, n). Indeed, this solution is obtained 
by aggregating the two binary matrices Ao and Ag: It obviously satisfies the 
projections and the non 4-adjacency constraint holds since the 1’s are in cells 
aij Such that i + j is even. 

In the same way, if the two subproblems restricted first to odd rows and 
even columns and second to even rows and odd columns both have a solution 
then Reconstruction, (m, n) has a solution: in that case, the 1’s are in cells 
aij Such that i+ j is odd. 

However, these conditions are not necessary; just consider the instance 
m=3,n=4, H = (2,1,2), and V = (2,1,1,1). 
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Necessary Conditions for the Existence of a Solution to 
Reconstructiony, (m, n) 


Following the definitions of Reconstructiony, (m), Reconstruction, (n), 
and Reconstruction, (m, n), we have the obvious necessary condition: If a 
solution to Reconstructiony,(m, n) exists, then there exists a solution to 
the corresponding instances of Reconstructiony, (m), Reconstructiony, (n), 
and Reconstructiony, (m, n). Nevertheless, these conditions are not suffi- 
cient as shown by the following instance: m — n — 4; H — (2,1,2,1); and 
V = (1,2,2,1). Now, we will focus on two particular cases: m = 2 and m = 3. 


Solving Reconstructiony, (m, n) when m = 2 


In this case vj € {0,1} for all j, for otherwise the non 4-adjacency constraint 
cannot be satisfied. From the columns such that v; = 1, we make K blocks B5, 
where each B* is a maximal set of consecutive columns with unary vertical 
projection (see Fig. 7.5). Let |^ denote the number of columns in B*, k = 
































1,..., K. Since the columns j that are not in a block are such that v; = 0, we 
have that Y55 , I^ = 2 
7/1 0 10 1/0/10 1/0/0 1 0 1 0 
6/10 10 1 0|0/0 1 010/11 O 1 0 1 
12 3 4 5 Ll 2-3 12 3 4 1 
p! B? B p^ 


Fig. 7.5. A matrix of N4 with m = 2. 


We can give the following proposition. 


Proposition 5. There exists a solution to Reconstructiony,(m,n) for 
m — 2 if, and only if: 
(Co) hy + hy = MiiUj 
(C1) hi € (n+ 1)/2, for i = 1,2; 
(C2) v; € {0,1}, fri <j <n; 
(C3) hi Py EE. [1¥/2], for = 1:2; 


Proof. (Co) and (C2) are obviously necessary as explained before. h; < 
(n + 1)/2 is the condition (C1) necessary for Reconstruction, (m): Any so- 
lution to Reconstructiony, (m, n) is a solution to Reconstructiony, (m); 
thus, (C1) is also necessary for Reconstruction y, (m, n). Eventually, for each 
block B; there must be a 1 in each column; to respect the non 4-adjacency 
constraint, there must be at least [i / 2| I's in each row of each block B* and 
(C3) is necessary. 
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Now, assume that the conditions are all satisfied. (C3) implies that 
hi — As [15/2] > 0; the number of blocks of odd length is equal to 
X£,I*-2XE,|[r*/2|2 E? v; - 2Z$ [07/2] = hi + he - 2E, |1I5/2|- 
(hy — XE, |I*/2]) + (ho — XE, |I*/2]) > 0. Then we can obtain a solution 
to Reconstructiony, (m, n) in the following way: 


1. Renumber the columns of each block from 1 to I^, k = 1,..., K; 
2. In the first row: 
a) “select” the first hy — X$ |1*/2| blocks of odd length, then, for these 
selected blocks, [i^ / 2| + 1 cells lying in the odd columns are set to 1. 
b) for the non selected blocks, |/*/2] cells lying in the even columns are 
set to 1. 
3. in the second row: for each block B*, the cells lying in the columns not 
having 1 in their first row (step 2) are set to 1. 


Doing so, we get exactly a 1 in each column of each block, as required, 
and our solution satisfies the adjacency constraint. In addition, in the first 
row we get |15/2] 1’s in each not selected block and |15/2]| +1 1’s in the hy — 
XE 4 [15/2] selected blocks. That gives (U/L, [15/2]) - (hı — XE, |l*/2]) = 
hı l's in the first row. Since the total number of blocks of odd length is 
(hi — SE, |1*/2]) + (ho — EE, |I^/2]), there are ho — EF, |1*/2| odd not 
selected blocks at step 2a. These not selected blocks are treated at step 2b 
and have 1’s in the first row in even columns; from step 3, they get [F / 2| +1 
1’s in the second row in odd columns while the other blocks get Xf , [1% / 2| 
1’s in the second row. Finally, we have XÆ} [1/2] + (ho — VEL, |l*/2]) = he 
1’s in the second row. 














Solving Reconstructiony,(m,n) when m = 3 


For a matrix A € 4 with three rows, Reconstruction, (m,n) has a so- 
lution only if v; € {0,1,2}, for all j, since the non 4-adjacency constraint 
cannot be satisfied if v; = 3 for some j. There is no 1 in columns j such that 
vj = 0, and there must be a 1 in the first and third rows in column j such that 
vj — 2. In order to satisfy the non 4-adjacency constraint, there is no column 
j such that vj; = vj41 = 2, for 1 < j < n. Thus, the condition (C2) that 
follows must be satisfied: vj € {0,1,2}, vj + uj41 < 4, for j = 1,... n — 1, 
and v, € {0,1,2}. In the sequel we suppose that (C5) holds. 

We have now to study columns with vj — 1; they can be grouped in K 
blocks B* of maximal length Į} as shown above in the case m = 2. Note that 
if there is a column j such that v; = 1 and (vj41 = 2 or vj. = 2), the 1 of 
column j lies in the second row. 

In a first step, let Reconstruction, (m, n)[B*] be the subproblem as- 
sociated with a block B^. We denote by H} = (h*, hk, hk) its horizontal 
projections. Note that at this stage H^ is unknown. We are going to give 
some conditions on H* for a solution to Reconstructiony, (m, n)|B*] to 
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exist. We consider three cases: a block B* is adjacent to zero, to one, or to 
two columns with v; = 2. 


Proposition 6. There exists a solution to Reconstructiony, (m, n)[B*] if, 
and only if, 
(Co) ht + hk + Ak = I" 
and 
Case 1: B® is adjacent to two columns with v; = 0, then 
(CO Ae < Wee for i= 1,2,3; 
Case 2: B® is adjacent to a column with vj = 0 and to a column with 
Uj = 2, 
then 
(C5) hẹ < "|. i=1,3 and 1 < h§ < [1/2]; 
Case 3: B" is adjacent to two columns with vj = 2, then 
(Cg) IF #2. If i^ = 1, then (hf = hE = 0 and hk = 1) else 
ht < (5 —1)/2| fori 2 1,3, and 2 € h$ < [1*/2]. 


Proof. First let us prove the necessity of these conditions. 

hi hE + hE = MAU IF is the necessary condition (Co). 

The non 4-adjacency implies (condition (C1)) that hë < [1*/2], i = 1,2,3, 
must be satisfied for any case 1, 2, 3. Thus, for case 1, (C4) is a necessary 
condition. 

For case 2, without loss of generality, we assume that the column adjacent 
to the left of B! has a vertical projection equal to two and that the column 
adjacent to the right of B^ has a vertical projection equal to zero. As noted 
above, there must be a 0 in the first and third rows and thus a 1 in the second 
row of the first column of B^. Thus, the conditions of case 2 must be satisfied. 

Eventually, in case 3, as noted above, there must be a 0 on the first and 
third rows and a 1 in the second row in the first and last columns of B^. It 
follows that the non 4-adjacency cannot be satisfied if |^ = 2. If |^ = 1, the 
necessity of the condition (Cs) is trivial. If [^ > 3, then we must have h5 > 2 
and for rows 1 and 3, the condition (C1) is h} < |(I* — 2)/2] = |(i* — 1)/2]. 
'The conditions of case 3 must be satisfied. 

Now, assume that these conditions are satisfied. Let us denote by j' and 
j" the first and the last columns of B^, respectively. The rows are numbered 
such that hë > hk > hk. 

We propose the following procedure to build a solution. 


1. If vj... = 2, then assign a 1 to the cell (2, 7’) and do v; = 0 and update 
hk to hk — 1. If vj 41 = 2, then assign a 1 to the cell (2, j”) and do vj» = 0 
and update hk to hk — 1. 

2. While hE 4 0, do the following: Let j be the leftmost column of B* such 
that vj = 1 and (i, j) is not 4-adjacent to a cell that contains a 1; assign 
1 to (i,j) and do vj = 0 and update h} to h} — 1 (row i is filled from the 
left). 
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3. While h? Æ 0, do the following: Let j be the rightmost column of B® such 
that vj = 1 and (i, j) is not 4-adjacent to a cell that contains a 1; assign 
a 1 to (2, j) and do v; = 0 and h¥ — h¥ —1 (row 7 is filled from the right). 
4. Assign a 1 to the cells (^, 7) such that vj = 1. 


Let us prove the validity of our procedure: (Cs) and (Cg) ensure that we 
can perform step 1. The conditions (C4), (C5), and (Cs) ensure that we can 
perform step 2 whatever the row i is. Note that after this step there are no 
successive columns j and j + 1 with vj = vj41 = 0. Thus, step 3 can be 
performed. One notes that after step 3, the l's cannot be non 4-adjacent. 
Now, from condition (Co), step 4 can be executed. So it remains to show that 
the non 4-adjacency is always satisfied after this last step. 

Since in step 4 the 1’s are assigned to the cells (7, j) with v; = 1, we 
have to prove that at the beginning of this step there are no columns j and 
j +1 such that v; = vj;41 = 1. In order to obtain a contradiction, let us 
suppose that there are such two columns. Let jı, respectively jo, be the last 
column where a 1 is assigned in row i (step 2), respectively row ? (step 3). 
Following the manner in which steps 2 and 3 work, we have that j > jı + 1 
and j -- 1X ja — 1. Thus, we have h > |hF/2| + |hE/2] +2 > (RE + AP) /2. 
Since hk > hk > hE, we get a contradiction. 














Now, in order to solve the whole problem, for each block B^, we have to 
determine a projection H* satisfying, first the conditions of Proposition 6, 
and second XE A5 = ha and Eft h} — na = hi for i = 1,3, where nz is the 
number of columns of A with v; = 2. We propose to determine these H Kis 
by searching for a flow in a complete bipartite network G = (X, Y, E) defined 
as follows (see Fig. 7.6): X = (Ri, R2, R3], where the node R; is associated 
to the row i; Y = tu. ke ois K}, where the node b^ is associated to the 
block B*. We add a source node S and three arcs (S, R;). The capacities of 
these arcs are set to hg for i = 2, and to hi — n2 for i = 1,3. We add a sink 
node T and K arcs (b^, T). These arcs have capacity I^ for k = 1,..., K. 
Eventually, the capacities of the arcs (R;, b^) are as follows: If i = 2 or B* is 
a block as in case 1 of Proposition 6, then the capacity is equal to [1^ / 2]; if 
i = lor i = 3 and B* is as in case 2, then the capacity is equal to [15/2]; 
finally, if i = 1 or i = 3 and B" is as in case 3, then it is equal to |^ — 1)/2]. 
We also have a lower bound for the arcs (H3, b^): If B* is as in case 2, then 
the lower bound is 1; if B^ is as in case 3 and I^ Æ 1, then the lower bound is 
set to 2. The lower bound of every other arc is 0. 


Theorem 3. For m = 3, there exists a solution to Reconstructiony, (m, n) 
if, and only if, there exists a flow of total value XE ,l* in the network G. 


Proof. If there exists such a flow in G, let h® be equal to the flow on the arc 
(R;,b*). On the one hand, for each block B*, the flow of the arcs (b^, T) is I^, 
for k — 1,..., K, that means that AT + hS + h5 = I* (condition (Co)) and as 
remarked above the other conditions of Proposition 6 are satisfied. Now using 
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Ri b! Cast l 
hy — Ny > E A 
A t 
S, c v H2, 7 , ; p case 2 $t 
(Koy I^ 
hg —n » 
ANR 7T 





Fig. 7.6. The network associated with a matrix of N4 with m = 3. 


the algorithm designed in the proof of Proposition 6, we know how to assign 
the l's in each block B^. On the other hand, since the value of the flow is 
Be = hı + h2 + h3 — 2n5, the matrix A satisfies the horizontal projection 
and then A is a solution of Reconstructiony, (m, n). 

Conversely, if a solution of Reconstruction y, (m, n) exists, H* = (hf, hk, 
hE) is known for each block B^, k = 1,..., K. Thus, setting the value h^ for 
the flow of the arc (/t;, b^), we obtain a feasible flow with value Y/_,/* in the 
network G. 














Corollary 1. For m = 3, Reconstructiony, (m, n) can be solved in polyno- 
mial time. 


Proof. The computation of the flow in G can be done in polynomial time (see 
[1]). Clearly, the algorithm given in the proof of Proposition 6 is polynomial. 
Eventually, one can check the necessary conditions in polynomial time. 














Binary Matrices under Non 6-Adjacency Constraint 


We prove that Consistency y, (m, n) is NP-complete. To do that, we show 
that a special case of Consistency y, (m, n) is equivalent to an L-tile packing 
problem defined as follows: An L-tile is a polyomino with three cells as shown 
by Fig. 7.7. In addition, the lower left cell has a 1 and the two other cells 
have a 0. The projections are the number of 1’s lying in each line (a cell not 
covered by an L-tile has a 0). The problem is to find L-tile packing with given 
projections H = (hi,...,h;,) and V = (vi,..., Un). 

Obviously, an L-tile packing exists only if hi = v, = 0. Henceforth we 
are concerned with Consistency,;,(m,n : hy = Vn = 0), the subproblem 
restricted to the instances with hi = vn = 0. 
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L-tile packing Square packing 


Fig. 7.7. A matrix of Ne (at left) and a matrix of Ms (at right) with their tile 
packings. 


Lemma 2. Consistency y. (m,n : hy = Vn = 0) is equivalent to the L-tile 
packing problem. 


Proof. Consider an instance H = (0, h2,..., hm) and V = (vi,...,v4 3,0) 
common to both problems. Let Sı be a solution to Consistency py, (m,n : 
hı = vn = 0). A solution $5 to the L-tile packing problem can be ob- 
tained by assigning the 1’s of S4 to S2. Indeed, since in $1 two 1’s are non 
6-adjacent, two L-tiles cannot overlap in S2. Conversely, a solution $4 to 
Consistency y, (m, n : hy = Vn = 0) can be obtained from a solution S2 to 
the L-tile packing problem, as well. o 





The L-tile packing problem is known to be N P-complete [11]. So we deduce 
the following theorem: 


Theorem 4. Consistency, (m,n) is NP-complete. 


Binary Matrices under Non 8-Adjacency Constraint 


Here, we study Consistency y, (m, n). We first prove that a special case of 
Consistency, (m, n) is equivalent to the problem of 2 x 2 squares packing 
defined as follows: A 2 x 2 square is a polyomino with four cells as shown 
by Fig. 7.7. The lower left cell has a 1 and the other cells have a 0. The 
projections correspond to the number of 1’s in each line (a cell not covered 
has a 0). The problem is to find a 2 x 2 square packing with given projections 
H = (hy,...,hm) and V = (vi,..., Un). Obviously, a 2 x 2 square packing is 
possible only if hy = vn = 0. 

Now, consider Consistency,,,(m,n : hy = vn = 0), the subproblem 
restricted to instances with hy = v, = 0. 


Lemma 3. Consistency, (m,n : hy = Vn = 0) is equivalent to the 2 x 2 
square packing problem. 
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Proof. The proof uses the same arguments as proof of Lemma 2. 


The 2 x 2 square packing problem has been proved to be as difficult as the 
three-colored image reconstruction problem, which is still an open problem 
[13]. We deduce the following theorem: 


Theorem 5. Consistency, (m, n) is as difficult as the three-colored image 
reconstruction problem. 


7.3.3 Conclusion 


In this section we have studied problems Reconstruction, (m, n) consist- 
ing of reconstructing a binary matrix that fulfills the non p-adjacency con- 
straint from orthogonal projections. Polynomial-time algorithms are given to 
solve Reconstructiony,(m,n) and Reconstructiony,(m,n) in the spe- 
cial case where m < 3. We also give a polynomial-time algorithm for 
Reconstructiony,(m), the problem consisting of reconstructing a binary 
matrix that fulfills the non 4-adjacency constraint from only one projec- 
tion. For Reconstructiony,(m,n), we have proved the N P-completeness 
of Consistency y, (m, n), its associated decision problem. The complexity 
status of both Reconstructiony, (m, n) and Reconstruction m, (m, n) re- 
mains an open question. However, we have shown that this last problem is 
harder than the three-colored image reconstruction problem, which is well 
known to be a major open problem in discrete tomography. 


7.4 A Timetabling Constraint 


We consider the problem of reconstructing an m xn binary matrix A € Tp from 
its horizontal H = (hi,...,h,,) and vertical V = (vi,..., Un) projections, 
where 7; is the class of binary matrices satisfying the timetabling constraint: 
If aij = 1, then a;j;-1 = 1 or a; j+1 = 1, and the number of successive 0’s in 
each row of A is less than or equal to a given integer k > 2. We denote this 
problem by Reconstructionz, (m, n). 

This section is organized as follows. Reconstruction;, (m,n) is intro- 
duced, and we give necessary conditions for a solution to exist. Then we state 
some basic properties for Reconstruction; (m, n : m = 2), the subproblem 
with m — 2. Finally, we use them to give a polynomial-time algorithm for 
Reconstruction7, (m, n : m = 2), the problem with m = 2 and k = 2. 


7.4.1 The Basic Model 


Formally, Reconstruction; (m, n) is defined as follows: We are given two 
vectors H = (hi,..., hm) and V = (vi,..., v) of nonnegative integers and 
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an integer k > 2. From H and V we want to reconstruct an m x n binary 
matrix A such that, for 1 C4 x mand 1€ j € n, 


Mas = hi, (7.1) 

Mia lij = Vj, (7.2) 

if Qi j = 1, then Qij X Qij—1 + aij X aiji > 0 (aio = li n+1 = 0), (7.3) 
Doai > 0, with <j € n—k41. (7.4) 

It is easy to see that (7.3) and (7.4) ensure that A € 7; and (7.1) and (7.2) 


ensure that both projections are satisfied. 
The following property gives necessary conditions for a solution to exist: 


Property 6. If Reconstruction;, (m,n) has a solution, then, for 1 <i < m 
andl<j<n, 
l 2|n/(k 4- 2)], if (n)mody4 3 < k, 
PT SUARUM Ds +2)|+1, otherwise, 
(b) A hi = yet Vj; 


(c) vj € vj-1 +vj+1,1 < j < n (assuming vo = vs44 = 0). 





Proof. The inequalities vj < m and n > h; of (a) are trivial. From the defini- 
tion of Reconstruction; (m, n), (7.3) and (7.4) imply that 


e if (n)modk+2 < k, then the sum of k + 2 consecutive elements of a generic 
row of A has to be greater than 1, and thus h; > 2|n/(k + 2)], 

e if(n)mody,» = k-4- 1, then the last k+ 1 elements of each row of A contain 
one entry 1 at least, so we have h; > 2|n/(k + 2)] +1. 














(b) and (c) follow from the definition of Reconstruction;, (m, n). 





7.4.2 Necessary Conditions for Reconstruction; (m, n : m = 2) 


From now on, we concentrate our study on Reconstruction7, (m,n : m = 
2), giving further necessary conditions for a solution to exist. 

'The choice m — 2 allows the values of each vertical projection to belong 
to the set (0, 1,2). Furthermore, if a vertical projection, say v;, has value 0 
or 2, then the corresponding values of the jth column of A are fixed, i.e., 
a1 j = a2,; = 0 or aij = a2,; = 1, respectively. Thus, the nontrivial case is 
when v; = 1, which implies a1 j + a2,; = 1. 

Let us decompose the vector V = (v1,...,Un) of vertical projections into 
the subsequences Bj, Bj, BY, B$, By, B3,...,B}, B}, B2, called blocks, such 
that the block B? is a maximal subsequence of successive 0’s of length /?, B1 
is a maximal subsequence of successive 1’s of length l}, and B? is a maximal 
subsequence of successive 2's of length 1?, with 1 < i < p. We also allow a block 
Bi to be empty, i.e., H = 0. Obviously, the sum of the lengths of the blocks 
of consecutive 2’s constitutes a lower bound to each horizontal projection. 
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For instance, if V = (1,1,1,0,0,2,0,0,1,2,1), then its decomposition into 
blocks is 
1,1,1, 0,0, 2, 0,0, 1, 2, 1 
we eae S Sw 
HB DI D? BQ DI D? DI By D DI BY BR, 


and the lengths of the blocks are 








P=Pa=ha=R=2=0, Pah=Rah=1, B=L=2, andi =3. 








It is straightforward that the length of every subsequence of any block de- 
pends on the length of the subsequences nearby. The following lemma provides 
necessary conditions for the existence of a solution. 


Lemma 4. If Reconstructionz,(m,n : m = 2) has a solution, and the 
vector V is decomposed into p blocks, then, for 1 < i < p, 


a)l? <k 

(a) l} € k, 

(b) ifi. =0, then 9 € k — 1, 

(c) #2 > 0, then I « k — 1, 

(d) if I2. , — 0 and (Il , 2 2 orl} > 2), then I} < k — 2. 


Proof. If 12 = 0, then the four statements are immediately satisfied. So, let 
B? be a nonempty block of consecutive 0’s, and let q and r be the indices 
of its leftmost and rightmost positions in V, respectively. Now, (a) and (b) 
are immediate from (7.4). To show (c), we note that if I} > 0, then we have 
G1,r414 02,741 = 1. Let us suppose, without loss of generality, that a1,r+1 = 0. 
If 12 > k—1, then m a1j = 0, which is inconsistent with (7.4). To show (d), 
we note that if /} > 2, then we have a1 r+1 +a2,r+1 = 1 and a1,-42+42,r42 = 1; 
without loss of generality, we suppose that a1,-41 = 0. Since a2,- = 0, (7.3) 
implies that a2,42 = 1, and, consequently, a1,,-42 = 0, which generates an 
inconsistency with (7.4). The case [1 , > 2 is analogous. 

















7.4.3 A Polynomial Algorithm for Reconstructionz, (m,n: m = 2) 


Let us consider an instance J = (H,V) of Reconstructionz,(m,n : m = 
2). In the sequel, we furnish some operative lemmas that change I into an 
equivalent instance I = (H,V) with ñ < n, by reducing the lengths of some 
blocks of the decomposition of V. The equivalence of the two instances will 
directly follow from next lemmas. 


Lemma 5. Let I be the instance I modified as follows: For each block B2 in 
the decomposition of V such that l2 = q > 2, let us set in I 


2—2, ñ=n-q+2, hj —-hi—-q-^«2, and h;—ha3—4-42, (7.5) 


and, let us modify the vector V in accordance with the new value of p. Then 
the instances I and I are equivalent. 
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Proof. Let A be a binary matrix that is a solution of J. For each column j 
of A such that vj belongs to the block B?, it holds a1,; = a2,; = 1. Deleting 
q — 2 such columns, we obtain a binary matrix A, as can be easily checked to 
be a solution of I. 

Conversely, let A be a solution of I , and let the projection v; belong to 
B2, If we add q — 2 columns after column j, and we set all their entries to the 
value 1, then we obtain the desired solution for T. 














Lemma 6. Let I be an instance of Reconstructionz,(m, n : m — 2). There 
exists am instance I equivalent to I, such that the vector V has no elements 
equal to 0. 


Proof. Let the vector V of the instance I be divided into 3p blocks, and recall 
that from Lemma 4, we have I? < 2, with 1 < i < p. 

If 19 = 2 (ie, the block B? is composed of the vertical projections 
(vj, vj41) = (0,0)), then from Lemma 4(b) and (c), we obtain that 12 , > 0 
and |] = 0, and from Property 6(c), we have |2 > 2 and I? > 2, i.e., 
(v;—2, see Uj43) = (2, 2, 0, 0, 2, 2). 

Let A be the matrix obtained from A by deleting columns 7 and j +1. It 
is straightforward that A is a solution of I if, and only if, A is a solution of 
the instance I such that 


H=H and V = (v1,..., vj jane Un): (7.6) 


We note that if 1? = 2 or 1} = 2 (i.e., (0,0,2) is the initial subsequence of V 
or (2,0,0) is the final subsequence of V), then the transformation still holds. 
Thus, we have that I = I. 

If 1? = 1 (i.e., the block B? contains the single vertical projection (vj) = 
(0)), then we analyze the following exhaustive series of subsequences of V. 


(a) (vj-2,..., vj42) = (2, 2,0, 2, 2): If A is a solution of J, then acting similarly 
to the previous case, i.e., when /9 = 2, we can construct a matrix A that 
is a solution of the instance I that is equivalent to I. 

(b) (vjza,...,vj43) = (2,2, 1,0,1,2,2): If A is a solutions of J, then there 
are only two possible submatrices corresponding to the seven columns 


j—353-3: 
1110011 1100111 
oon) ee noni): Hu 


Starting from A, we can construct the matrix A by deleting its three 


columns j — 1, j, and j +1, so that A is a solution of I if, and only if, A 
is a solution of the instance J such that 





H = (hi = 1, h2 = 1) and Y = (v1, 00, Uj—2, 042; Ur): (7.8) 
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(c) (uj—3,---,¥j+3) = (2,2,1,0,1,2,1): If A is a solution of J, then its only 
two possible submatrices corresponding to the seven columns j—3,...7+3 


are 
1110011 1100110 (7.9) 
iT00110J °° 1110011/)° i 

Starting from A, we can construct the matrix A by deleting its four 


columns j, j +1, j + 2, and j + 3, so that A is a solution of the instance 
I such that 


H = (hy — 2, he — 2) and V = (vi,..., vj-1, Uj44y +++ n)- (7.10) 


The equivalence of the instances J and I can be easily deduced. 

(d) (uj-3,.--,U;+3) = (1,2, 1,0, 1, 2,2): symmetrical to the previous case. 

(e) (uj—3,---,U;+3) = (1,2, 1,0,1, 2,1): similar to the previous two cases. 

(£) (vj-1,---,Uj+2) = (2,0,1,2): From Property 6, we have that vj_2 = 2. 

The matrix A obtained after deleting column j from A is a solution of the 

instance [ , where H=H , and V is obtained from V after deleting the 

entry vj. Obviously, J is equivalent to Í. 

) (vj-1;---, 0542) = (2, 1,0,2): symmetrical to the previous case. 

h) (v1) = (0) (respectively, (vn) = (0)): If A is a solution of J, then the 
matrix Å obtained from A after deleting its first (respectively, its last) 
column is obviously a solution of instance J such that H = H, and V is 
obtained from V after deleting the entry v1 (respectively, vn). 





The given list of configurations of the entries of V surrounding column j 
when B? = (v;) is exhaustive, since Lemma 4 assures that the subsequences 
(1,1,0) and (0,1, 1) cannot be subsequences of V. 














Lemma 7. Let I be an instance of Reconstructionz,(m, n : m = 2). There 
exists an instance I, equivalent to I, such that the vector V contains no sub- 
sequences of consecutive entries 1 having length greater than 4. 


Proof. Let the vector V of the instance I be divided into 3p blocks, and let us 
suppose that there exists a block B} = (vj,...,vj44) = (1,...,1) such that 
ll =q+1 > 5, with 1 € i € p. If A is a solution of J, then the submatrix 
composed of its four columns j, j +1, j + 2, and j + 3 is one among 


1100 0011 1001 0110 
Lon Ge (0110) and era alt) 
Moreover, for each of these four configurations, it is easy to check that the 
columns j and j + 4 are identical, and so the matrix A obtained from A by 


deleting its four columns j, j +1, j +2, and j +3 is a solution of the instance 
I, where 


H = (hi = 2, ho ar 2) and V = (v1, e, Uj—-1,Uj+4,--- Un). (7.12) 


The equivalence of I and I is straightforward. Repeating the deletion of all 
the groups of four consecutive entries 1 from V, we obtain the lemma. 
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7.4.4 Graph Building 


The three lemmas stated above allow us to solve Reconstructionz, (m,n : 
m = 2) by considering only its instances that satisfy the three constraints 
l1? = 0, I} < 4, and I? € 2, with 1 < i € p. Now, each one of such instances, 
say instance 7 = (H, V) to keep the notation already introduced, is going to 
be reduced to an instance of a fast solvable problem concerning the search for 
paths inside a graph. 

Let V be divided into 2p blocks Bi, B?,..., B}, B7. From the sequence of 
blocks, we construct a directed labelled graph G — (X, A, lab). 

'The nodes of G, i.e., the elements of the set X , are associated to the blocks 
B? in the following way, for 1 < i < p: 


(a) if 1? = 2, then we associate one node x; to the block B2; 
(b) if 1? = 1, then we associate two different nodes x} and x? to the block B2. 


We also add to X a source node zo and a sink node zp+1. 

The arcs of G, that is, the elements of A, connect the nodes associated to 
the blocks B? ,, and B? and correspond to the blocks B}. Since Lemma 6 
allows us to get rid of the blocks composed of 0’s, for each 1 < i € p, we list 
all the possible configurations for the submatrix A} associated with the block 
B} (this permits us to reduce the number of arcs) as follows: 

If Il = 1, then the submatrix A} can be one of the following: 


m = o or [lg = (1) j (7.13) 


if 11 = 2, then A} can be one of the following: 


; fil , (00 x 418 ; (Y. 
= (oo) = (5). B= (à) or “i= (5): (7.14) 


if 11 = 3, then A} can be one of the following: 


3 f110\ o 00b. our f 100 3_ Od 
Hi = 0 > H2 = 110 /" H3 = 011 Or Ja = 100 ’ (7.15) 


finally, if 1} = 4, then A} can be one of the following: 


4 (1100\ 4 (0011) 4 (1001 , (0110 
m Gar E m am (ino) PROSPECT Gar Eee) 


Note that every possible configuration for A} satisfies (7.4) of the ba- 
sic model, but not (7.3). So this last property is not local, but global. In 
other words, the “compatibility” of a specific configuration with the block B} 
strongly depends on the configurations associated to the blocks B?_, and B?. 
As an “extreme” example, the submatrix u3 is not compatible with the block 
Bj (assuming Ij = 3), while it is compatible with the block BJ when l} = 3 
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(assuming [3 — 2). Indeed, in order for this last property to be satisfied by 
the global solution, we need to express this constraint in terms of the arcs of 
the graph we are going to build. 

We can now assign the arcs of the graph connecting the nodes associated 
to the block B2 , with those associated to the B2, as follows: 


(a) I£ 12 , = 12 = 2, then there is an arc (sj 1, 2;); 

if 12 , 22 and l? = 1, then there are two arcs (zr; 1,21) and (zi 1,22); 

if |I? , — 1 and l? = 2, then there are two arcs (z]1 ,,;) and (2 ,,2i); 

if 1? į; Sand l? = 1, then 

a) if 11 = 1, then there are two arcs (x}_4, x2) and (x21, x}); 

b) if 11 = 3, then there are two arcs (x}_;, £1) and (£? ,,22); 

c) if I} = 2 or I] = 4, then there are all four possible arcs (x1 4,1), 
(214,27), (22 4,21), and (z? 4,27). 


a 
b 
c 


em 


) 
) 
) 
d) 


— 


Let us refer to y; a generic node. The function lab associates to each arc 
(yi-1,Ui) a set of integers (labels) in the interval [0,4], depending on the 
possible submatrices that are compatible with the blocks B2 , and B? in the 
final solution A. 

More precisely, each label counts the number of 1’s lying in the first row 
of each possible configuration u? of the block B] which is compatible with 
the blocks B? , and B? and is such that I} = j. We remark that the word 
"compatible" is once again used with the meaning of satisfying the timetabling 
constraint. 

A further assumption is needed: If an arc reaches the node x} (respectively, 
z2), then the configurations chosen for the assignment of its labels has to end 
with the column (1,0)7 (respectively, (0, 1)7). 

An example of such a graph is given by Fig. 7.8. The graph has seven 
nodes in addition to zo and z5, since (1121112212111121) is composed of four 
blocks B2, one of which, B2 of length 2, gives rise to node x2, and the others 
of lengths 1. Since Bl has length 2 configurations, u? and ji2 are the only 
ones that satisfy property (7.3) of the basic model. Therefore, we assign label 
{2} to (xo, 1) corresponding to configuration u? and label {0} to (xo, 27) 
corresponding to configuration u2. B4 has length 3, and we assign label (1,2) 
to (z1, 22), since the compatible configurations are u3 and u3, and label (1,2) 
to (a7, 22), since the compatible configurations are u3 and u3. Then, Bł has 
length 1, and since B2 has length 2, all the configurations are compatible, and 
so we label (23,21) with {1} (corresponding to pt), and (x2, x2) with 0 (cor- 
responding to u4). Bt has length 4, and the label {2} to (x4, z1) corresponds 
to configuration u43, while the label to (z1,22) corresponds to configuration 
u4. We do similarly for the arcs from z2. Finally, (x}, 5) has label {1}, which 
corresponds to configuration pt, and (x2, £5) has label {0}, which corresponds 
to configuration jid. 

To the path m = (£0, x?, £2, £2, x}, £5) of Fig. 7.8, correspond the two 
matrices 
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2} 02 {l m 
{2} 
a {1,2} {0} 
Fig. 7.8. The graph built from (1121112212111121). 


(7.17) 


M= 0011101101100110 d M= 0011001101100110 
~ (1110011111011011 ~ \ 1110111111011011 


The graph G has the following property: 


Lemma 8. There is a one-to-one correspondence between the paths of G and 
the matrices A satisfying (7.2), (7.3), and (7.4) of the basic model. 


Proof. Let 7 = (zo,y1, ..., Yp, tp41) be a path of G. The matrices A corre- 
sponding to 7 are such that M = Aj.Aj.Aj...A}.A?, where the submatrices 
A? are associated to the nodes y; of m and the submatrices A} are associated 
to the arcs (yi—1, yi). 

Consider any node y;, 1 < i € p: if y; = xi; then 


A? = Gal (7.18) 


while if y; = x} or y; = z2, then A? = (1,1)T. 

For an arc (z;:1,2;), 2 < i X p, associated to the block Bl of length 
ll = j, Al is a certain matrix ji. 

In particular, for the first arc (xo, y) of 7, we have the following matrices 
At: If j = 0, then A} does not exist; if y = zi, then we have A} = ut or 
Ay = pa for j= 1, Al = pi or Aj = p3 for j = 2, Al = uj or Al = p3 for 
j = 3, Apad aA cud for j = 4; if y = vi, then we have Aj = pi for 
j= 1, Al = u? for j = 2, Al = u3 for j = 3, Al = p4 for j = 4; and in the 
case y = xî, we have At = wd for j = 1, At = ua for j = 2, At = uj for j = 3, 
Al = pî for j = 4. 

For the last arc (y, £p+1), Y F Tp; of 7, we Im the following uc A: 
If y = £p—1, then we have A] = ul or Ai =) for j = 1, Ai = HH or Aj Eus 
for j — 2, Aj = id or Aj = jd for j ='3, Al = ui or Al todas =a if 
y = z} 1, then we have A1 = uł for j = 1, ‘Al = for 7 = 2 Ap = p4 for 
j= 3, Ap = ui for j — 4; ‘in the case where y = 22. 1, we have Aj = pj for 
j=1, Aj = Hi for j = 2, A} = p3 for j = 3, A} = pý for j =4. 
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For an arc (y, z1), 2 < i € p, A} is a matrix ji) such that its last column 
is (1,0)7, that is, A! = yt for the case j = 1, A} = u? or A} = u3 in case of 
j = 2, A} = u3 or A} = u3 for the case j = 3, Al = u$ or A} = p3 for the 
case j — 4. 

For an arc (y, z2), 2 € i € p, Al is a matrix uj] such that the last column 
of pi is (0, 1)7, i.e., A} = u4 for j = 1, A} = ud or Al = p for j = 2, AL = ui 
or Al = p3 for j = 3, Al = pî or Al = på for j = 4. 

In a same way, for an arc (x}_,,y), 2 € i < p, Al is a matrix ji] having 
(0, 1)7 for last column, i.e., id for j = 1, u2 or u2 for j = 2, uj or u3 for 
j = 3, uł or u4 for j = 4. 

Finally, for an arc (x?_,,y), 2 € i € p, Al is a matrix with last column 
(1,0)., i.e., ut for j = 1, u? or u3 for j = 2, u3 or u3 for j = 3, ut or u for 
Ji 

The matrices ul satisfy (7.2) and (7.4) of the basic model, and so the 
matrices A satisfy (7.2) and (7.4). 

Trivially, the point (7.3) of the basic model is satisfied for the submatrix 
associated with a node z;,1 < i € p, and for a matrix u? associated with an 
arc (z; — 1, xi) of v. Now let us consider a node z}: Following the choices made 
for the matrices jz) associated with the arc (y, x!) entering in x} and for the 
arc (z], y) leaving out x}, (7.3) is satisfied. The same fact holds for the nodes 
x? of T. 

Now let A be a matrix satisfying (7.2), (7.3), and (7.4) of the basic model. 
The nodes of u = (o0,---,%p41) are as follows: Let B? be a block such that 
l? = 1. We denote by A? the column of A corresponding to B?, and we denote 
by A’ the submatrix of A consisting of the three columns 4-1, AJ, AJt?. 
From (7.3), A? is either 


lj 110 lj 011 
LUKE jus 
A in) or A Cans (7.19) 


In the first case, u passes through x}, and in the second case, the path u goes 
through the node z?. If B? is a block such that l? = 2, then x? is a node of 
the path u. Following the manner in which the arcs are set in G, we have that 
u = (£9,...,2p41) is a path of G. 














Let u be a path from zo to z541. Denoting by A;,,0 < i < p, the set of 
integers making the label of the arc (yi, yi+1) of u, the integer set H (u) is 
defined as H(u) = H(xp41) with 


0, if i — 0, 

H(y) = DN Bae Hcet Se. (20) 

Using hi = |(j | vj = 1}| (the number of columns with vertical projection 

equal to one), the next lemma gives a necessary and sufficient condition for a 
solution of Reconstructionz, (m,n : m = 2, k = 2) to exist. 
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Lemma 9. A solution of Reconstructionz,(m,n : m = 2) exists if, and 
only if, there is a path u = (xo,..., tp41) of G such that hı € H(p). 


Proof. From the previous lemma, it remains to prove that a matrix satisfying 
(7.1) of the basic model exists if, and only if, there exists a path u of G such 
that hı € H(y). 

In G, the values making the label of an arc are the sums of the first row of 
the matrices u? associated with this arc (see the proof of the previous lemma). 
Thus, if A is a matrix with the first row sum equal to hy, its corresponding 
path in G is such that hı € H (u). Now, if there exists u such that hı € H (y), 
then there exists a matrix A with h4 for the sum of its first row. 














Hence, we are able to establish the following result. 
Theorem 6. Reconstructionz, (m, n : m — 2) can be solved in time O(n?). 


Proof. G being a directed acyclic graph, we can use dynamic programming, 
i.e., the Ford algorithm (see [14], Section 24.2) to compute the values H (x). 
Since a path from zo to 54.4 has length p and the label of an arc of G is such 
that A C {0,1,2}, we have that H(z;) € (0,...,2p], 0 € i € p4- 1. Each node 
of G has at most two ancestors, so H(x;) can be computed with time O(p). 
So we can compute the values H (x) in time O(p?). 














7.4.5 Conclusion 


In this section we have studied Reconstruction; (m,n), the problem of 
constructing a binary matrix that fulfills a particular timetabling constraint 
from orthogonal projections. We have designed a polynomial-time algorithm 
for the special subproblem Reconstructionz, (m, n : m = 2), where m = 2 
and k — 2. Using similar but more combinatorial and sophisticated arguments, 
we can design a polynomial-time algorithm for Reconstruction; (m,n : 
m — 2). The complexity status of Reconstruction; (m,n) is still open 
even for its subproblem in which m = 3 and k = 2. 
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Part II 


Discrete Tomography Reconstruction 
Algorithms 


8 


Decomposition Algorithms for Reconstructing 
Discrete Sets with Disjoint Components 


P. Balazs 


Summary. The reconstruction of discrete sets from their projections is a frequently 
studied field in discrete tomography with applications in electron microscopy, image 
processing, radiology, and so on. Several efficient reconstruction algorithms have 
been developed for certain classes of discrete sets having some good geometrical 
properties. On the other hand, it has been shown that the reconstruction under 
certain circumstances can be very time-consuming, even NP-hard. In this chapter 
we show how prior information that the set to be reconstructed consists of several 
components can be exploited in order to facilitate the reconstruction. We present 
some general techniques to decompose a discrete set into components knowing only 
its projections and thus reduce the reconstruction of a general discrete set to the 
reconstruction of single components, which is usually a simpler task. 


8.1 Introduction 


The reconstruction of two-dimensional discrete sets from few (usually up to 
four) projections is a frequently studied field of discrete tomography. Recon- 
struction algorithms have a wide area of applications, e.g., in electron mi- 
croscopy, image processing, nondestructive testing, radiology, statistical data 
security, and so on (see the final chapters of this book and [14] for some of 
the applications). 

The main challenge in designing reconstruction algorithms is that the re- 
construction problem is usually underdetermined; therefore, the number of 
solutions for a given reconstruction task can be very large. Moreover, the re- 
construction under certain circumstances can be NP-hard [10, 12, 18]. Since 
applications require fast algorithms that give unambiguous solutions, scien- 
tists of this field began to study the possibility of keeping the reconstruction 
process tractable and to reduce the number of solutions. A commonly used 
technique that can lead to efficient reconstruction is to suppose some prior 
information on the set to be reconstructed. The most frequently used proper- 
ties are connectedness, directedness, and some kind of discrete versions of the 
convexity. A lot of work has been done in designing efficient reconstruction 
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algorithms for different classes of discrete sets satisfying some of the above 
properties [4, 5, 6, 7, 8, 11, 15, 16, 17]. While the reconstruction from two 
projections in the class of hv-convex sets is in general NP-hard [18], it turned 
out that the additional prior knowledge that the set has only one component 
(i.e., it is 4-connected) leads to a polynomial-time reconstruction algorithm 
[6]. Later, this algorithm was generalized for the class of 8-connected hv- 
convex sets [4]. Surprisingly, in [3] the authors showed that in this class the 
prior information that the set to be reconstructed consists of more than one 
component (i.e., if the set is not 4-connected) leads to a more efficient recon- 
struction algorithm than the general one developed in [4]. The above results 
show that prior knowledge about the number of components can have an effect 
on the reconstruction task. 

The aim of this chapter is to present what we know about reconstruc- 
tion complexity in several classes when the additional information that the 
set consists of several disjoint components is also given. The chapter is struc- 
tured as follows: After introducing the necessary definitions and notations 
(Section 8.2), we investigate the problem of reconstructing discrete sets with 
disjoint components from two projections in Section 8.3. In Section 8.4, we de- 
scribe a general technique for reconstructing decomposable discrete sets from 
four projections. Section 8.5 is about the correctness and complexity of the 
algorithms presented in the previous two sections. Finally, in Section 8.6, we 
conclude our results. 


8.2 Preliminaries 


A discrete set F is a finite subset of Z? and can be represented as a set of 
pixels or by a binary matrix F (see Fig. 8.1). The elements of a discrete set are 
called points or positions. The smallest containing discrete rectangle (SCDR) 
of F is denoted by R(F). For any discrete set F, we define the vectors H(F) = 
H = (hi,..., hm), V(F) =V= (01, ..., Un), D(F) = )-= (di, ... di. i); 
and .A(F) = A= (a1,..-,@m+n—1), where 


hi =X fiy, t= 1. m (8.1) 
j=l 
v= X fi, J=l,...,n, (8.2) 
g= 
dk = 5 fa âk = 5 fa. k—1,...,m-- n— 1. (8.3) 
it(n—j)=k i+j=k+1 


The vectors H, V, D, and A are called the horizontal, vertical, diagonal, 
and antidiagonal_projections of F', respectively. The cumulated vectors of F 
are denoted by H = (hi,...,hm), V = (%1,.--,Un), D = (di,...,dm+4n-1), 
and A = (@,...,@m-4n—1) and defined by the following formulas: 
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Fig. 8.1. A discrete set F represented by gray unitary squares and the corresponding 
binary matrix F. The projections of F are H, V, D = (0,0,0,1, 2,3, 2,2,1,0,0), 
and A — (0,0,2,2,1,1,2,0,1,2,0). The cumulated vectors of F are H, V, D= 
(0,0, 0, 1,3,6,8, 10, 11, 11, 11), and A= (0, 0, 2, 4, 5, 6, 8, 8, 9, 11, 11). 


Ah -— d  €e1ogm (8.4) 
i—1 
j 
Vj— v, 3 eee E (8.5) 
[zu 
k k 
d, = Vd, a= y dn k —1,...,m -n— 1. (8.6) 
l=1 fal 


Given an arbitrary class G of discrete sets, we say that the discrete set 
F € G is unique in the class G (with respect to some projections) if there is 
no other discrete set F’ € G with the same projections. 

In this chapter, we study the reconstruction problem from two and four 
projections in several classes, whose tasks for a given class G C F can be 
formulated as follows: 
2-RECONSTRUCTION(G) 


Instance: | Two nonnegative vectors H € Nọ and V € Nj. 
Task: Construct a discrete set F € G such that H(F) = H and 
V(F)=V. 
4-RECONSTRUCTION(G) 
Instance: Four nonnegative vectors H € Nj’, V € Nọ, D € BRA 
and A € Ny t. 
Task: Construct a discrete set F € G such that H(F) = H, V(F) = 


V, D(F) = D, and A(F) = A. 

In order to keep the reconstruction process tractable and to reduce the 
number of solutions of a given reconstruction problem, it is often supposed 
that the set to be reconstructed has some special geometrical features. In the 
following, we briefly describe the ones used most frequently. 
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Connectedness and Convexity 


Two points P = (pi,p2) and Q = (q1,q2) in a discrete set F are said to be 
4-adjacent if |p3 — qi| + |pa — q2| = 1. The points P and Q are said to be 
8-adjacent if they are 4-adjacent or (|[p1 — qı| = 1 and |p2 — q2| = 1). The 
sequence of distinct points Po,..., Pk is a 4/8-path from point Po to point Py 
in a discrete set F if each point of the sequence is in F and Pj is 4/8-adjacent, 
respectively, to Pj. for each | = 1,..., Kk. A discrete set F is 4/8-connected 
if for any two points of F there is a 4/8-path, respectively, in F between 
them. The 4-connected set is also called as polyomino [13]. Clearly, every 
4-connected set is also 8-connected, but the converse is not true [see, e.g., 
Fig. 8.2(b)]. The discrete set F is horizontally convex/vertically convex (or 
shortly, h-convex/v-convex) if its rows/columns are 4-connected, respectively. 
The h- and v-convex sets are called hv-convex [see Figs. 8.2(b) and (d)]. 

For any point P = (pi, p2), we define the four quadrants around P by 


Ro(P) = {Q = (um, @) | a < pı and q2 € po}, 
Ri(P) = {Q = (u,@) | a = pı and q@ < po}, 
Ro(P) = {Q = (u,@) | a = pı and q2 2 pə}, 
R3(P) = {Q = (m1, 42) | n < pi and q2 > po} . (8.7) 


A discrete set F is Q-convex if R(P) N F Æ ( for all k € {0,1,2,3} implies 
P € F (see, e.g., Fig. 8.2(c)]. 


Remark 1. More precisely, the above definition corresponds to Q-convexity 
along the horizontal and vertical directions as Q-convexity can be defined in 
a more general manner [7]. However, for the sake of simplicity, we will use this 
abbreviated form. 
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Fig. 8.2. (a) A NE-directed polyomino. (b) An hv-convex 8- but not 4-connected 
discrete set. (c) A Q-convex but not 8-connected discrete set. (d) An hv-convex 
decomposable but not Q-convex discrete set. 


Directedness 


A 4-path in a discrete set F is a northeast path (or shortly, NE-path) from 
point P$ to point P, if each point P, of the path is in north or east of Pj. 
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for each | = 1,...,t. SW-, SE-, NW-paths are defined similarly. The discrete 
set F is NE-directed if there is a particular point of F, called the source, such 
that there is an NE-path in F from the source to any other point of F. It 
follows from the definition that the source of a NE-directed set is necessarily 
the point (m,1) [see Fig. 8.2(a)]. Similar definitions can be given for SW-, 
SE-, and NW-directedness. We simply say that the discrete set is directed if 
it is NE-, SW-, SE-, or NW-directed. 


Decomposability 


A maximal 4-connected subset of a discrete set F is called a component of 
F. We say that that two components Fı and P5 of F are disjoint if both 
the sets of the row indices and the sets of the column indices of Fy and F> 
are disjoint. For example, the discrete set in Fig. 8.1 has three components: 
Fi = {(1,3), (2,2), (2, 3): F = {(4, 1), (4,2), (4, 3), (4, 4), (5, 3): and P3 — 
{(5,5), (5,6), (6,5), where F1 and F3 are disjoint. 

Given two discrete sets C and D represented by the binary matrices C= 
(ĉij)mixnı and b- [dis ons: respectively, we say that we get the discrete 
set F represented by the binary matrix F = Chia by a northwest-gluing 
(or shortly, NW-gluing) from C and D if 


Ê= i 5) such that ma > mı + Mo and ng > nı + ne. (8.8) 


We stress the importance of the fact that in the resulting set F there 
can be empty rows or/and columns between the subsets C and D (namely, if 
ma > Mı + mz or/and na > nı +72). If C is a single component, then we say 
that C is the NW-component of F. NE-, SE-, SW-gluings and -components 
are defined similarly. We say that a discrete set F consisting of k (k > 2) 
components is decomposable if 


(a)the components are uniquely reconstructible from their horizontal and 
vertical projections in polynomial time, and 

(B) the components are pairwise disjoint, and 

(y)if k > 2, then we get F by gluing a single component to a decomposable 
discrete set consisting of k — 1 components using one of the four gluing 
operators. 


As a straightforward consequence of this definition, we obtain that every 
discrete set consisting of one component is nondecomposable and every dis- 
crete set consisting of two or three components and satisfying properties (a) 
and (8) is decomposable. Figure 8.3 shows some decomposable and nonde- 
composable configurations if the set satisfies property (9) and consists of four 
components. 

If omitting empty rows and columns the SCDRs of the components of a 
decomposable discrete set F are connected to each other with their bottom 


158 P. Balazs 
















































































































































































F, F, F, F, 
F, F, F, F, 
F, F, F, F, 
F, F, F, F, 
(a) (b) (c) (d) 
F, F, F, F, 
F, F, F, F, 
F, F, F, F, 
Ifa p E F, 





















































(e) (f) (g) (h) 


Fig. 8.3. Some decomposable (a)-(f) and nondecomposable (g)-(h) configurations 
of the components F3, F5, F5, and F4. The SCDRs of the components are marked 
with gray squares. The possible empty rows and columns are not indicated. 


right and upper left (bottom left and upper right) corners, then we say that 
F is of type 1/2, respectively [see Figs. 8.3(e) and (f) in the case of four 
components]. 

Throughout this chapter we will use the following notations for classes of 
discrete sets. 


(a) Så for the class of 8- but not 4-connected hv-convex sets. 

(b) Q for the class of non-4-connected Q-convex discrete sets. 
(c) DEC for the class of decomposable discrete sets. 

(d) S*, S** for the class of discrete sets of type 1/2, respectively. 


8.3 Reconstruction from Two Projections 


'The reconstruction from horizontal and vertical projections in classes of dis- 
crete sets having some convexity and connectedness properties has been fre- 
quently studied by mathematicians and computer scientists. In [10] a survey 
is given for the reconstruction complexity in classes of discrete sets having 
properties h-convexity, v-convexity, or 4-connectedness. Discrete sets satisfy- 
ing none of the above properties can be reconstructed in O(mn) time by an 
early result from Ryser [17]. In [10] it is shown that if a set satisfies only one 
or two of the above properties, then its reconstruction is NP-complete. In the 
same time, if all three properties are satisfied, i.e., in the class of hv-convex 
polyominoes, the reconstruction can be solved in polynomial time. Several 
methods have been presented for reconstructing hu-convex polyominoes and, 
later on, they have been improved to reconstruct also 8-connected hv-convex 
discrete sets in polynomial time (see [4] for the comparison of the algorithms' 
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complexities). In both classes of hv-convex 8- and 4-connected sets, the worst- 
case time complexity of the fastest algorithm for solving the reconstruction 
problem is of O(mn - min(m?, n?]). Surprisingly, in [3] the authors showed 
that the additional knowledge that the hv-convex 8-connected set is not 4- 
connected (i.e., it is in $2) leads to a reconstruction algorithm with worst-case 
time complexity of O(mn - min{m,n}). The aim of this section is to describe 
a generalized version of the algorithm given in [3] for reconstructing discrete 
sets of O' from its horizontal and vertical projections. 


8.3.1 Q-Convex Sets with Disjoint Components 


Consider that we want to reconstruct a Q-convex discrete set from its 
horizontal and vertical projections having the prior information that the 
set has at least two components. That is, we want to solve problem 2- 
RECONSTRUCTION(Q’). We first summarize some results from [9] and [16] 
concerning directed discrete sets, which will be important in designing the 
reconstruction algorithm. 


Theorem 1. Every hv-convez directed discrete set can be reconstructed from 
its source and its horizontal and vertical projections uniquely in O(mn) time. 


Following from the definitions directly, we can check the directedness of 

an hv-convex polyomino by investigating a single position. 
Proposition 1. Let G be an hv-convez polyomino with R(G) = (i',..., i" Y x 
U^ oJ") 

1. G is SE-directed if, and only if, gy j = 1. 

2. G is NW-directed if, and only if, ĝin jn — 1. 

3. G is SW-directed if, and only if, gy jv — 1. 

4. G is NE-directed if, and only if, ĝin j = 1. 


Our algorithm will reconstruct the Q-convex discrete set component by 
component. This can be done with the aid of the following theorem, which 
describes the relation between decomposable and Q-convex discrete sets. 


Theorem 2. Q’ C S* US**. 


Proof. (Sketch.) Let F € Q’ having components F\,...,F, with R(Fj)) = 
[sud x {7],.-.,97/} (1 = 1,...,k). Since F is non-4-connected, it has 
at least two components, i.e., k > 2. Every Q-convex set is hv-convex, too, 
therefore, any two arbitrary components of F are disjoint; thus, property (3) 
is satisfied. Without loss of generality, we can assume that 


Lea Sa <ii ism. (8.9) 
'Then, exactly one of the following cases is possible: 

1&4 S4 < ja Sja <+ SHH, oF (8.10) 

N=f2H>p2h>-- 2H 21. (8.11) 
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If for F (8.10)/(8.11) holds, then F\,...,F,-1 are NW/NE-directed and 
P5,..., Fk are SE/SW-directed, respectively. Since in both cases all the com- 
ponents are directed, therefore, by Theorem 1, property (a) is satisfied, too. 
Finally, if (8.10) holds, then F € S*; otherwise (if (8.11) holds), F € S**. 
Certainly, property (y) is satisfied, too. 














Remark 2. For further details of the proof, see [2]. 
Now, we can show how to represent a set of Q’. 


Corollary 1. Let F € Q! having components F,,...,F,. Then there are 
uniquely determined row indices 0 < iy < +--+: < ik € m and column in- 
dices 0 < jı <+++ < jk € n such that for each l= 1,..., k (k > 2), (à, ji) is 
the bottom right position of R(Fi) if F € S* and (ii, jy 141) is the bottom left 
position of R(F1) if Fe S™. 


Depending on the type of F, let us define 


u [Gn fj.) | L9 1... k— 1) if Fe S*, 
oe eae li-L.ok-i1 fres", 8) 


where 71,...,7% and ji1,...,jy denote the uniquely determined indices men- 
tioned in Corollary 1. That is, on the basis of Proposition 1, Cr consists of 
the source points of the NW-/NE-directed components F),..., Fx. .1 if F has 
type 1/2, respectively (see Fig. 8.4). The knowledge of any element of Cr is 
useful in the reconstruction of an F € Q’, as we can see on the basis of the 
following theorem. 


Theorem 3. Any F € Q' is uniquely determined by its horizontal and vertical 
projections, its type, and an arbitrary element of Cr. 


Proof. Let us suppose that F € S* and (4j, jj) € Cr is given for some | € 
(L,...,k — 1). Then (i, jj) is the source of the NW-directed component F}, 
which can be reconstructed uniquely on the basis of Theorem 1. Suppose 
that R(Fi) = (2,...,4) x {7’,..., jı}. Then, the source (i5, 4, 37,4) of the 
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Fig. 8.4. (a) A discrete set F € Q’ of type 1 and (b) a discrete set F” € Q’ of type 
2. The elements of Cr and Cr; are marked with white dots. 
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SE-directed component Fj,1 and the source (ij 1, ji-1) of the NW-directed 
component PF; ; are uniquely determined. Namely, 

ilı min(i| i >i; and h; £0}, 

jı = min{j | j > jı and vj £0}, 
i1 = max{i | i < i' and h; Z 0) , 

ji-1 = max{j | j € j' and vj # 0}. 





Again, by Theorem 1, F;+}ı and Fj_; can be reconstructed uniquely. Then the 
sources of F1,» and Fy_2 are determined. The method can be continued until 
Fi and Fy are reconstructed. The proof is similar if F € S**. 














As a direct consequence, we see from Theorem 3 that different solutions 
of 2-RECONSTRUCTION(Q’) with the same type must have different source 
points. 


Corollary 2. If F, F' € Q' are different solutions of the same reconstruction 
problem and they have the same type, then Cp N Cg. = 0. 


8.3.2 Equality Positions 


Let H and V be the cumulated vectors of F € Q'. We say that (i,j) € 
[1,...,m— 1] x {1,...,n—1} is an equality position of type 1 of F if hy = Uj. 
We say that (i,j) € {1,...,m} x {2,...,n +1} is an equality position of type 
2 of F if h; = Ùn — 9; ,. Equality positions (i, j) for which h; = 0 and/or 
vj = 0 are not interesting for this study and are omitted. We denote the set 
of equality positions of type 1/2 by LL/L2, respectively. Equality positions 
are useful for finding the elements of C. 


Lemma 1. Let F € Q'. Then Cp C LL if F € S* and Cr C LL if Fe S**. 
Proof. Let us suppose that F € S* and define a set E as follows: 
E = ({1,..., ib x {j +1,...,n}) U t+ 1,...,m} x {1,...,9}). (8.17) 


If (i,j) € Cr, then FN E = 0, and so Cr C L} since 


fiev duos EO Loc Slt | ENA {L.i} x {1} 
t=1 


=|FNA{1,.. m} x lee Gg} = So te (8.18) 


t=1 


If F € S**, then define F’ as follows: 


E' = ({1,...,¢} x (13,...,3- 1)U (Ci c Lh... mE x (,...,n]). (8.19) 
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If (i,j) € Cr, then FN E' = 0, and so Cr C LẸ since 


ha hes os bovis oc posco = He eps es oes m] 
t=1 


n 


= |FN{1,...,m} x {j-n} = ou = Ùn = Ùj (8.20) 


t=j 














8.3.3 The Reconstruction Algorithm 


The algorithm for reconstructing sets of Q’ from their horizontal and vertical 
projections can now be outlined as follows. 


Algorithm 1 
1. Identify the sets L} and LẸ; 
2. while L} 4 0 do 
F=0;l:=1; 
take an arbitrary element (i,j) € L} and let L5 = L} \ {(i, j)}; 
try to reconstruct the NW-directed component F from source (i, 7) 
with n) = (hi, ety hi) and Y(E) = (v1, e y vj); 
repeat 
if F, is not reconstructed then break; 
F:=FUF;1:=141; 
identify the source (ir, jf) of Fi and try to reconstruct the 
SE-directed component Fi from source (ij, jf) with the 
corresponding projections; 
if Fy, is reconstructed then 
if F, is the last component then output F U Fi; 
else if F, is NW-directed then Ll. = Ly \ ((à,j)); 
until F; is the last component or F; is not NW-directed; 
3. Repeat step 2 with NE- and SW-directed components using set L7; 


This algorithm works as follows. We first assume that the set F € Q’ to 
be reconstructed has type 1. On the basis of Theorem 3, it is sufficient to find 
an arbitrary element of Cr to reconstruct F from its horizontal and vertical 
projections uniquely. The elements of Cp are equality positions of type 1 on 
the basis of Lemma 1. So, in order to find all solutions of the reconstruction 
problem, we check for every element of Ll. whether it is an element of C. 
Without losing any solution, we can assume that if an investigated equality 
position (i, j) € Ll is in Cp, then it is the source of the first component F}. 
From the proof of Theorem 2, we know that this component is NW-directed. 
Now, in order to decide if (i,j) is the source of Fi, we try to reconstruct 
an hv-convex NW-directed polyomino G with source (i, j) such that H(G) = 
(hi,...,h;) and V(G) = (vi,...,vj). If there is no such polyomino, then, 
clearly, (i, j) cannot be the source of F; and we continue with the investigation 
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of the next equality position from LL. Otherwise, we can assume that F} = 
G and we try to reconstruct the remaining components iteratively. The ith 
component F; (l = 2,...) is SE-directed, so we try to reconstruct an hv-convex 
SE-directed polyomino G such that the number of its points in each row and 
column are equal to the corresponding elements of H and V, respectively. The 
source of F; must be the position (i7, ,, j7,,), where i7, and jj, , are defined 
by (8.13) and (8.14), respectively. If it is not possible to reconstruct the SE- 
directed polyomino, then, clearly, (47, ,, j/,,) cannot be the source of Fy, which 
contradicts the assumption that (i, j) is the source of F1, and we continue with 
the investigation of the next equality position from LL. Otherwise, we check 
whether the reconstructed component was the last one. This can be done by 
checking the bottom right position (i, j”) of G. If (i", j”) 4 (m,n), then G 
cannot be the last component. Then, G must be NW-directed on the basis of 
the proof of Theorem 2. This can be investigated by Proposition 1. If gi», j” A 
1, then G is not NW-directed and, clearly, F; Z G, which contradicts the 
assumption that (i, j) is the source of F1. We continue with the investigation 
of the next equality position from LL. Otherwise, that is, when ĝi j» = 1, 
we can assume that F, = G. On the basis of Corollary 2, G cannot be the 
first component of any other solution of the same type; therefore, (;", j”) can 
be eliminated from Ll. and we continue with the next iteration. If (i", j”) = 
(m,n), then F, = G and F = Fi U---U Fj. We found a solution and we 
continue with the investigation of the next equality position from L4 in order 
to find another solutions. Step 3 of the algorithm is similar. We assume that 
F has type 2 and try to build NE- and SW-directed components from the 
corresponding sources. If no solutions are found after investigating all equality 
positions of both types, then the assumption that F € Q' is not met, i.e., there 
is no discrete set with the given projections which is Q-convex and consists of 
at least two components. However, in some cases there can be several solutions 
(see Fig. 8.5). 















































































































































Fig. 8.5. Six sets of O' with the same horizontal and vertical projections. 


Example 1. Suppose that we wish reconstruct the Q-convex discrete set F in 
Fig. 8.6(a) from the vectors H and V. In this case, LL = {(1, 1), (2,3), (3, 4)}, 
L?. = {(4,2)}, and Cr = {(2,3)}. First we assume that the set to be recon- 
structed is of type 1. Supposing that (1,1) is the source of the NW-directed 
component Fı [Fig. 8.6(b)], the algorithm fails after reconstructing the SE- 
directed second component [Fig. 8.6(c)] because there is no room for F3. Then 
the position (3, 4) can be deleted from LL. Assuming that (2,3) is the source 
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of Fi [Fig. 8.6(d)] the algorithm gives a solution of type 1 [Fig. 8.6(e)]. Finally, 
assuming that the set to be reconstructed is of type 2, we suppose that (4,2) 
is the source of Fi. Since there is no room for F}, the algorithm ends (there 
is no solution of type 2). 
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Fig. 8.6. An illustration of how Algorithm 1 works. 


8.4 Reconstruction from Four Projections 


'The aim of this section is to present a general decomposition technique for 
reconstructing discrete sets from four projections. The algorithm uses similar 
ideas as the one of the previous section. Although this method works in more 
general classes of discrete sets, the price we pay is that we need four projections 
and some more technical lemmas. Due to space considerations, the technical 
proofs of the theoretical results in this chapter are omitted. The interested 
reader can find them in [1]. 


8.4.1 The Center of a Decomposable Discrete Set 
We can give a description of decomposability as follows: 


Lemma 2. A discrete set F is decomposable if, and only if, it satisfies prop- 
erty (a) and there exists a sequence of discrete sets po... pU) (k > 2) such 
that F consists of one component, F(? = F, and for each l= 1,..., k — 1, 
we get porn by gluing a component to FO using a gluing operator. 


For a given discrete set F € DEC, the sequence described in Lemma 2 is not 
uniquely determined. We will refer to any sequence satisfying the properties 
of Lemma 2 as a gluing sequence of F. For example, the sequences F(? = 
1,0 F°) = {(1, 1), (2,2)}, a) = {(2,2)}, and Go = {(1,1), (2,2)} 
are both gluing sequences of the same discrete set F = {(1, 1), (2,2)}. Clearly, 
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every gluing sequence of a given decomposable discrete set must be of the same 
length (namely, the length is always equal to the number of the components 
of the discrete set). If the discrete set F € DEC consists of two components, 
then, clearly, either F € S* or F € S**. Moreover, from property (y) it 
follows that if F,...,F is a gluing sequence of F € DEC V (S* U S**), 
then there exists a unique integer 1 < j < k such that FO) € S* U S** 
and FG*U g S* U S**. The following lemma shows the relation between two 
arbitrary gluing sequences of a same discrete set. 


Lemma 3. Let F™,...,F™ and GOO,..., GU). be two different gluing se- 
quences of the same discrete set F € DEC \ (S* U S**). Moreover, let 
1 «€ j,j' « k such that FD), GG? € S*US**, and FOD, Got) g S*US*™. 
Then j = j' and F) = GO’, 


Based on this lemma we can say that, for every set F € D£C, there exists 
an integer j such that, in every gluing sequence F(),..., F9 = F, FY) is 
the same, FY) € S* U S**, and FÜtU g S* US** (if F € S* US**, then 
j — k and this latter relation is not important). The uniquely determined set 
FO) is called the center of F and is denoted by C(F). For example, if the 
configuration of the components of the set F is given as in Fig. 8.3(a), then 
C(F) = F> U Fs, while in the case given in Fig. 8.3(b), C(F) = Fy U F; U F4. 


8.4.2 Finding a Component 


Before giving any further details, we must mention one important fact accord- 
ing to property (a). In fact, to satisfy this property, we usually need to have 
some a priori information about the components. For example, one can assume 
that the components are NW-directed and hv-convex since in this case prop- 
erty (o) is fulfilled (see Theorem 1). In the following, where it is important to 
emphasize this in order to avoid confusion, we will write DECe for the class 
of decomposable discrete sets whose components are from a certain class C 
where property (a) holds. On the basis of properties (o) and (8) in the recon- 
struction of a decomposable discrete set, it is sufficient to identify the SCDRs 
of the components. In this subsection we only deal with NW-components. 
However, the results given in the following can easily be modified to find NE-, 
SE-, or SW-components, too. In order to find the SCDR of a NW-component, 
we first give a necessary condition that is quite similar to Lemma 1. 


Lemma 4. Let F € DEC. If (i,j) is the bottom right position of the SCDR of 
the NW-component of F, then i is the smallest integer for which there exists 
an integer j with hy = 0; = ài, 1 > 0 and ai, — 0. 


Unfortunately, this lemma does not give a sufficient condition for iden- 
tifying the SCDR of a component of F. For example, if the discrete set is 
F = ((1,3), (2,2), (5,1)}, then for the position (2, 2) the conditions of Lemma 
4 hold (since ho = Ù = G3 = 2 and a4 = 0) but F has no NW-component. 
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With the aid of the following theorem, it is possible to test whether the de- 
composable discrete set has a NW- or SE-component. 


Theorem 4. Let F € D£Cc, H(F) = (hi,..., hm), V(F) = (vi,..., Un), 
and A(F’) = (a1,.-.,@m+n—1)- If (i, j) is a position satisfying the necessary 
conditions of Lemma 4 such that a polyomino P € C exists with H(P) = 
(hi, nr hi), V(P) = (v1, . sa Ui); and A(P) = (a1, oa Qiii); then P is 
the NW-component of F or/and F has a SE-component. If no such position 
exists, then F has no NW-component. 


Note that if the conditions of the above theorem hold, then in some cases 
the discrete set can have both NW- and SE-components; for example, if the 
discrete set is in S*. However, Theorem 4 does not state that the discrete set 
necessarily has a NW-component (see, e.g., Fig. 8.7). 
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Fig. 8.7. (a) A decomposable discrete set that has no NW-component although the 
position (5, 4) satisfies the conditions of Theorem 4 with the polyomino in (b). 


Remark 3. 'The polyomino in Fig. 8.7(b) is not uniquely determined by its hor- 
izontal and vertical projections. Its symmetrical pair has the same projections. 
However, using the prior information that the set to be reconstructed can 
consist of rectangle-shaped components or polyominoes shown in Fig. 8.7(b), 
property (a) can be satisfied. 


If the set is of S*/S**, then with the aid of the NW/NE-version of Theorem 
4, it is possible to find the SCDR of the NW/NE-component of F, respectively. 
This means that once we have decomposed all the components around the 
center of F, Theorem 4 gives an effective tool to reconstruct the center itself. 
On the basis of the following theorem, one can find the NW-component of F 
(if it exists) if F € DEC \ (S* US**). 


Theorem 5. Assume that F € DEC \ (S* U S**) and (i,j) satisfies the 
conditions of Theorem 4 with a polyomino P. Moreover, let R(C(F)) — 
{i1,...,42} x (41... j2}. Then P is the NW-component of F if, and only 
if, there exists i' € {i1,..., i2} such that à < i! or there exists j' € (31,..., ja] 
such that j <j’. 


An observation similar to Corollary 2 will speed up the reconstruction. 
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Corollary 3. If F, F' € DEC are different solutions of the same reconstruc- 
tion problem, then R(C(F)) and R(C(F")) are disjoint. 


8.4.3 The Reconstruction Algorithm 


Now, an algorithm can be outlined for reconstructing decomposable discrete 
sets with given horizontal, vertical, diagonal, and antidiagonal projections. We 
first describe a procedure for decomposing a NW-component knowing that it 
belongs to a given class C where property (o) holds. 


Procedure 1 DecomposeNW 


1. Find the position (i, j) for which hy = Ùj = Q4j-1 > 0 and ayy; = 0; 
if no such position exists then return(no component); 
2. if j > l then return(no component); 
3. Construct a polyomino P € C with H(P) = (hi... hi), 
V(P) = (v1, n .,Uj), and A(P) = (a1, Sais (Oir 1) 
if no such polyomino exists then return(no component); 
.F:-FUP; 
. Update H, V, D, and A according to the projections of P; 
6. return; 


QU a 


'The procedures for decomposing components from the other three direc- 
tions can be outlined similarly. The main algorithm for reconstructing decom- 
posable discrete sets produces four vectors H € NP, V € N}, D e Ntt, 
and A € Ny as input, and outputs the set S containing all the de- 
composable discrete sets with the projections H, V, D, and A such that the 
components are from a given class C where property (o) holds. This algorithm 
reconstructs the solutions component by component, calling the procedures 
for decomposition. The algorithm can be outlined as follows: 


Algorithm 2 
1. C := {1,... n} \ es Oe S := 0; 
2. F := 0; l := min C; restore H,V, D, and A; 
3. repeat 
call DecomposeN W; 
if (no component) then call Decompose NE; 
if (no component) then call DecomposeSE; 
if (no component) then call DecomposeS W; 
until (no component); 
. Try to reconstruct the last component; 
5. if D=0 and A=0 then ( S:= SU {F}; 
C := CN {columns of C(F)}; } 
6. if C = Í then return S else goto Step 2; 


A 
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This algorithm works as follows. First, the set of solutions is S = 0, and 
the set C contains the possible columns of a solution’s center (step 1). Then in 
step 2, we set F = (), we restore the vectors H, V, D, and A to their original 
values, and we try to find a solution such that the leftmost column of its center 
is | = min C. In step 3, we first try to decompose a NW-component. This is 
done by Procedure 1, which seeks the SCDR of the NW-component of F by 
trying to find the uniquely determined position that satisfies the conditions of 
Theorems 4 and 5 with a polyomino P (steps 1-3 of Procedure 1). If no such 
position exists, then either the assumption that the set to be reconstructed 
has a NW-component was false or the /th column cannot be a column of the 
center. In both cases, the procedure simply returns. Otherwise, the polyomino 
P is assumed to be the NW-component of F; we simulate the effect of this 
component on the projections of F (steps 4 and 5 of Procedure 1) and we 
return to the main algorithm. If we were not able to find a NW-component, 
then we try to decompose a NE-, SE-, and SW-component, similarly, in this 
order. Note that Theorem 5 cannot be applied on sets of F € S* U S**, 
but this does not lead to failing our algorithm since in this case instead of 
a NW/NE-component we will decompose a SE/SW-component if the set is 
of type 1/2, respectively. If we found a component (which can be any of the 
four kinds), then we go on and try to decompose further components from 
the remaining set. We repeat this until we cannot decompose a component. 
In this case, the remaining set is nondecomposable; therefore, it must consist 
of a single component. We try to reconstruct this last component in step 
4. Then in step 5 we check whether the reconstructed set F has the given 
projections; if so, then we set S :— SU {F}. On the basis of Corollary 3, 
columns of C(F) cannot be the columns of any other solution; therefore, we 
eliminate the columns of C(F) from C and we go on to step 2 in order to find 
possible further solutions with other centers. Algorithm 2 investigates in each 
iteration whether a solution with a certain center exists by assuming that a 
given column is a column of the center, too. On the basis of Theorem 5, this 
strategy can also be applied to the rows of the center. If m « n, then we will 
choose the latter version of our strategy. 


Example 2. Suppose that we wish to reconstruct discrete sets of DECe (where 
C is the same class as mentioned in Remark 3) from the vectors 

H = (1,2,2,2,1,1,2,2,2,1,1) , 

V = (1,3,3,1,0,1,3,3,1,1), 

D > (0,0, 0,0, 0,0, 1,2,2,3,4, 2, 2,1, 0,0, 0, 0, 0, 0) , 

A = (0,0,1,3,3,1,0,0,0,0,0,0,1,3,3,1,0,0,0, 1). (8.21) 
First, it is assumed that the first column is a column of the center; therefore, 
the position (i, j) that satisfies h; = 0; = @4;-1 > 0 and aj, = 0 violates 
Theorem 5 [Fig. 8.8(a)]. Then the algorithm reconstructs SW-components 


in two iterations [Figs. 8.8(b) and 8.8(c)]. After that, in the next two itera- 
tions, the algorithm decomposes two NE-components in the remaining part 
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(k) (1) (m) (n) (0) 


Fig. 8.8. An example how Algorithm 2 works. The identified discrete rectangles 
which must contain the next component are marked with bold squares. The column 
assumed to be a column of C(F) is filled with slanted lines. 


of the discrete set [Figs. 8.8(d) and (e)]. Finally, the algorithm reconstructs 
the last component and finds a solution F [Fig. 8.8(f)]. After eliminating 
the columns of C(F) from C, we go on by assuming that the sixth column 
is a column of a possible other solution’s center. With this assumption, we 
decompose a NW-component [Fig. 8.8(g)]. Then, again, a NW-component 
cannot be decomposed in the remaining part as its SCDR violates Theorem 5 
[((Fig. 8.8(h)]. In the following, we decompose a SE-, a NE- and, again, a 
NE-component [Fig. 8.8(i),(j), and (k), respectively]. Reconstructing the last 
component [Fig. 8.8(1)], we find another solution F”. Finally, assuming that 
the last column is the center, we decompose two NW-components [Figs. 8.8(m) 
and (n)] and reconstruct the last component [Fig. 8.8(0)]. However, the re- 
constructed set does not have the given diagonal projection; therefore, the 
algorithm ends and the number of solutions is two. 
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8.5 Correctness and Complexity 


In this section we prove the correctness and state the complexity of the al- 
gorithms of the previous two sections. First, we study the complexity of the 
reconstruction using only two projections. 


Theorem 6. Algorithm 1 solves problem 2-RECONSTRUCTION(Q’) in O(mn- 
min(m, n)) time. The algorithm finds all sets of Q’ with the given projections. 


Proof. Every row and column index can be in an equality position of both 
types at most once. This means that we have at most min{m,n} equality 
positions of type 1 and at most min(m, n) equality positions of type 2. More- 
over, equality positions can be found in time O(m + n) by the comparison 
of the cumulated horizontal and vertical vectors. Building the components of 
F assuming that an equality position (i,j) is in Cr takes O(mn) time (see 
Theorem 1). We have to examine every equality position if it is in Cr, so we 
get the execution time O(mn : min(m, n]) in the worst case. 

On the basis of Theorems 2 and 3, the reconstructed sets are Q-convex and 
have the given projections H and V. On the basis of Theorem 3, any element 
of Cr together with the projections and the knowledge of the type of F is 
sufficient to reconstruct F uniquely. Elements of Cr are equality positions, 
too, on the basis of Lemma 1. Algorithm 1 examines every equality position 
whether it is in Cp, and so the second part of the theorem follows. 














Sets of S; have the same properties as sets of Q’ (see [3] for further de- 
tails). The only difference is that the SCDRs of the components in the class 
O' might be separated (i.e., there can be empty rows and/or columns be- 
tween two consecutive components) while in the class S they are always 8- 
connected [see also Fig. 8.2(c) for an example]. This observation immediately 
implies that 2-RECONSTRUCTION(S;) can also be solved in O(mn-min{m, n}) 
time. However, there is an interesting improvement in the complexity of the 
reconstruction if the set of O' to be reconstructed is not 8-connected. 


Theorem 7. 2-RECONSTRUCTION(Q'\S{) can be solved in O(mn) time. The 
number of solutions is at most two. 


Proof. Let F € Q'\S%. Since F is not 8-connected, there exists at least one 
empty row in R(F) (say the ith) such that h; 4 0 or an empty column in 
R(F) (say the jth) such that v; # 0. Assume that the ith row of R(F) is 
empty and h; #0. Let i* = max(i' | de = hi and hy z 0] and 


(8.22) 


a J min(j' | oj = Àj) if fS 
m max(j' | Un — Ùj = hi) ifFes"*. 


Note that (i*, j*) can be found in O(m 4- n) time. Since F is Q-convex, the 
position (i*, j*) must be the source of a NW/NE-directed component if F is of 
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type 1/2, respectively (see Fig. 8.9). Therefore (i*, j*) € Cr and the solution 
is uniquely determined on the basis of Theorem 3. Consequently, there can be 
ambiguity only in the type of F, i.e., the number of solutions is at most two. 
In some cases, there exist solutions of both types (see Fig. 8.9). The proof can 
be given analogously if the SCDR of the discrete sets F € S* U S** has an 
empty column. 






















































































Fig. 8.9. Two sets of Q’ with an empty row and column and the same horizontal 
and vertical projections. 


Now we prove the correctness of the algorithm that uses four projections. 


Theorem 8. Let C be an arbitrary class of polyominoes that can be recon- 
structed in this class uniquely from their horizontal and vertical projections 
in. polynomial time (say in O(f(m,n)) time). Then Algorithm 2 solves 4- 
RECONSTRUCTION(DECc) in O(min(m,n] - f(m,n)) time. The algorithm 
finds all sets of DECe with the given projections. 


Proof. Let F be an arbitrary set of S. From Theorems 4 and 5, it follows 
that F is decomposable. Due to step 3 of Procedure 1, the horizontal and 
vertical projections of F are equal to the given vectors H and V, respectively. 
Moreover, step 5 of Algorithm 2 guarantees that the diagonal and antidiag- 
onal projections of F are also equal to the vectors D and A, respectively. 
Assuming that the lth (l = 1,..., k) component to be reconstructed is a NW- 
component, it takes O(m 4- n) time to find the (uniquely determined) position 
that satisfies the necessary conditions of Lemma 4. We do it simply by scan- 
ning the vectors H, V, and A. In order to test whether this position is the 
bottom right position of the SCDR of the NW-component, we try to recon- 
struct this component based on Theorem 4, which takes O( fı) time. The same 
is true if the lth component is a NE-, SE- or SW-component. In the worst 
case, the component is a SW-component, i.e., we try to reconstruct the lth 
component at most four times, and so the time complexity of reconstructing 
all the components (steps 3 and 4 of Algorithm 2) is of O( f (m, n)), which is 
polynomial on the basis of property (a). In the worst case, we iterate steps 3 
and 4 of Algorithm 2 by assuming that the first, second, ..., nth columns of 
G are also columns of C(G) (if n « m) or by assuming that the first, second, 
.., mth rows of G are also rows of C(G) (if m < n). This means that steps 
3 and 4 of Algorithm 2 must be repeated at most min{m,n} times. So we 
get that the total execution time of Algorithm 2 is of O(min(m, n] - f (m, n)). 
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Since we check every possible column (row) of F whether it is a column (row) 
of C(F), it follows that the algorithm reconstructs all sets of DEC with the 
given projections. Consequently, if the algorithm returns S = (), then there is 
no solution of 4-RECONSTRUCTION(DEC). 














If a discrete set is of type 1/2, then, following from Corollary 3, the solution 
is uniquely determined. With the aid of Theorem 4, this solution can be 
found with a single iteration of steps 3 and 4 of Algorithm 2. Knowing that 
SZC Q'CS*US**, this observation leads to the following corollaries. 


Corollary 4. 4-RECONSTRUCTION(Q’) can be solved in O(mn) time. The 
solution is uniquely determined. 


Corollary 5. 4-RECONSTRUCTION(S{) can be solved in O(mn) time. The 
solution is uniquely determined. 


Remark 4. (a) If a discrete set is of type 1, then the horizontal, vertical, and 
antidiagonal projections are sufficient for the reconstruction. Similarly, 
for reconstructing a set of type 2, it is sufficient to know its horizontal, 
vertical, and diagonal projections. That is, knowing the type of the discrete 
set, the reconstruction can be solved even from three projections uniquely. 

(b) The class of decomposable discrete sets for which the NW/NE/SE/SW- 
version of Theorem 4 gives both necessary and sufficient conditions for 
the existence of a NW/NE/SE/SW-component, respectively, is somewhat 
broader than S* US**. This class of sets is called strongly decomposable 
and was studied in [2]. 

(c) Although the reconstruction of hv-convex discrete sets is, in general, an 
NP-hard problem [18], the decomposition technique described in this sec- 
tion can be applied to a subclass of hv-convex discrete sets yielding a 
polynomial-time reconstruction algorithm [1]. 


8.6 Conclusions and Discussion 


In this chapter we studied the reconstruction complexity in some well-known 
classes of discrete sets using the additional knowledge that the discrete set 
consists of several (disjoint) components. We learned that in two important 
classes, namely in the class of hv-convex 8-connected discrete sets and in the 
class of Q-convex discrete sets, using the extra information that the set con- 
sists of at least two components (i.e., it is in S$, and Q’, respectively), the 
reconstruction problems can be solved from two projections quite effectively. 
Moreover, we got the interesting result that the reconstruction from two pro- 
jections in the class of non-8-connected Q-convexes can be even faster. In 
the case of four projections, we investigated the general class of decompos- 
ables and presented a polynomial-time algorithm for reconstructing discrete 
sets of this class from four projections. All presented algorithms are able to 
find all the solutions for a given reconstruction problem. The complexity of 
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reconstructing decomposable discrete sets from two projections is an open 
question. What we do know is that in some cases there can be exponentially 
large numbers of decomposable discrete sets with the same horizontal and 
vertical projections. Therefore, if an algorithm that reconstructs all the solu- 
tions of such a reconstruction task exists, then for some instances the time 
complexity of the algorithm is exponential. However, it might be possible that 
finding only one solution can be done in polynomial time. 
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Network Flow Algorithms for Discrete 
Tomography 


K.J. Batenburg 


Summary. There exists an elegant correspondence between the problem of recon- 
structing a 0-1 lattice image from two of its projections and the problem of finding 
a maximum flow in a certain graph. In this chapter we describe how network flow 
algorithms can be used to solve a variety of problems from discrete tomography. 
First, we describe the network flow approach for two projections and several of its 
generalizations. Subsequently, we present an algorithm for reconstructing 0-1 images 
from more than two projections. The approach is extended to the reconstruction of 
3D images and images that do not have an intrinsic lattice structure. 


9.1 Introduction 


The problem of reconstructing a 0-1 image from a small number of its projec- 
tions has been studied extensively by many authors. Most results deal with 
images that are defined on a lattice, usually a subset of Z?. Already in 1957, 
Ryser studied the problem of reconstructing an mxn 0-1-matrix from its row 
and column sums [19, 20]. He also provided an algorithm for finding a recon- 
struction if it exists. Ryser's algorithm is extremely efficient. In fact, it can 
be implemented in linear time, O(m + n), by using a compact representation 
for the output image [5]. 

The problem of reconstructing a 0-1 matrix from its row and column sums 
can also be modeled elegantly as a network flow problem. In 1957, Gale was the 
first to describe the two-projection reconstruction problem in the context of 
flows in networks, providing a completely different view from Ryser's approach 
[7]. In the latter work, there was no reference to the algorithmic techniques for 
solving network flow problems. In 1956, Ford and Fulkerson published their 
seminal paper on an algorithm for computing a maximum flow in a network [6], 
which can be used to solve the two-projection reconstruction problem. Using 
the network flow model, Anstee derived several mathematical properties of 
the reconstruction problem [2]. 

'The reconstruction problem from two projections is usually severely un- 
derdetermined. The number of solutions can be exponential in the size of the 
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image. In practice, the goal of tomography is usually to obtain a reconstruction 
of an unknown original image, not just to find any solution that has the given 
projections. If only two projections are available, additional prior knowledge 
must be used. Certain types of prior knowledge can be incorporated efficiently 
into the network flow approach, by using the concept of min cost flows. 

A drawback of the network flow approach is that it cannot be generalized to 
the case of more than two projections. The reconstruction problem is NP-hard 
for any set of more than two projections [8]. Recently, an iterative approach 
for reconstructing 0-1 images from more than two projections was proposed 
by Batenburg [3]. In each iteration a reconstruction is computed from only 
two projections, using the network flow approach. The reconstruction from the 
previous iteration, which was computed using a different pair of projections, 
is used as prior knowledge such that the new reconstruction resembles the 
previous one. 

In this chapter the network flow approach will be described, starting with 
the basic two-projection case. Section 9.2 describes the basic network flow for- 
mulation. In Section 9.3, the model is extended to incorporate prior knowledge 
in the reconstruction procedure. Section 9.4 deals with how the network flow 
approach can be made tolerant to noise and other errors. The implementation 
of network flow algorithms for discrete tomography is discussed in Section 9.5. 
Several highly efficient implementations of network flow algorithms are avail- 
able. This section also addresses the time complexity of the relevant network 
flow algorithms. The basic iterative algorithm for reconstructing from more 
than two projections is described in Section 9.6. This algorithm can be gener- 
alized to 3D reconstruction very efficiently, which is discussed in Section 9.7. 
So far, all sections deal with lattice images. In Section 9.8, we discuss how 
the algorithms from the previous sections can be adapted to the problem of 
reconstructing binary images that do not have a lattice structure. 


9.2 Network Flow Formulation for Two Projections 


The reconstruction problems of this paper can be posed in several different 
forms. We mainly consider the reconstruction of a subset F of Z? from its 
projections, but one can also formulate this problem in the context of recon- 
structing binary matrices or black-and-white images. In the case of binary 
matrices, the set F is represented by the set of matrix entries that have a 
value of 1. If we want to display a set F C Z? and F is contained in a large 
rectangle A C Z? (e.g., 256? elements), it is convenient to represent F as 
a black-and-white image. The white pixels correspond to the elements of F; 
the black pixels correspond to the remaining elements of A. Continuous to- 
mography algorithms, such as the algebraic reconstruction technique (see, e.g., 
Chapter 7 of [17]), represent the reconstruction as a gray-level image. At sev- 
eral points in this chapter, we discuss how to utilize algorithms for continuous 
tomography for solving the discrete reconstruction problems. In these cases 
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we use the black-and-white image representation of F, as this representation 
can easily be connected with the gray-level images from continuous tomogra- 
phy. Depending on the representation of the set F, points in A may also be 
called entries (in the context of binary matrices) or pixels (in the context of 
black-and-white images). 

In this section we consider the problem of reconstructing a subset F of the 
lattice Z? from its projections in two lattice directions, v(? and v. This is a 
generalization of the problem of reconstructing a binary matrix from its row 
and column sums. 

We assume that a finite set A C Z? is given such that F C A. We call 
the set A the reconstruction lattice. As an illustration of the concept of the 
reconstruction lattice, consider the representation of F as a black-and-white 
image. The set A defines the boundaries of the image: All white pixels are 
known to be within these boundaries. 

We denote the cardinality of any finite set F by |F|. Define No = (x € 
Z |x 20). Let v(0,v(2 € Z2. A lattice line is a line in Z? parallel to either 
v or v) that passes through at least one pani in Z?. Any lattice line 
parallel to v) (k = l ,2) is a set of the form (nv? +t | n € Z} for t € Z?. 
The sets £C) and £°?) denote the sets of lattice lines for directions vU and 
v2 respectively. For k = 1,2, put L® = (£ e L® | £n A F Ø}. Note 
that L and L®) are finite sets. We denote the elements of L? by £p; for 
i=1,...,|L|. As an example, Fig. 9.1 shows the reconstruction lattice for 
A= {1,2,3} x {1,2,3}, vY = (1,0), and v® = (1,1). For this example, the 
sets L® and L) contain three and five lattice lines, respectively. 

For any lattice set F C Z2, its projection p(9 : L(9 — No in direction 


v(9 is defined as j 
PO (0) 2 |Fn 4 2 M7 f(a) (9.1) 
cel 


where f denotes the characteristic function of F. The reconstruction problem 
can now be formulated as follows: 


Problem 1. Let uv“), v? be given distinct lattice directions, and let A C Z? 
be a given lattice set. Let p® : L(U — No and p? : L(? — No be given 
functions. Construct a set F C A such that po — p(? and pe = p®), 


Define S9 = ee Lh) p? (£). We call S the projection sum for direction 
v(?), Note that if F is a solution of Problem 1, we have S? = |F| for k = 
1,2. In Section 9.4, a generalization of Problem 1 will be described for which 
the prescribed projections p! and p(2 may contain errors. In that case the 
projection sum for direction v may be different from the projection sum for 
direction v2. 

With the triple (A, v(U, v2), we associate a directed graph G = (V, E), 
where V is the set of nodes and E is the set of edges. We call G the associated 
graph of (A, v (0 , VQ). 
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The set V contains a node s (the source), a node t (the sink), one node 
for each £ € L(U, and one node for each £ € L). The node that corresponds 
to £j; has label ni ;. We call the nodes nz; line nodes. 

Nodes n4, and nz j are connected by a (directed) edge (n1,;,n2,;) if, and 
only if, £4,; and £5,; intersect in A. We call these edges point edges and denote 
the set of all point edges by Ep € E. There is a bijective mapping 9 : Ep — A 
that maps (ni,;,n2,j) € Ep to the intersection point of /,.; and /».;. We call 
$ the edge-to-point mapping of G. For e € Ep, we call $(e) the corresponding 
point of e and for x € A, we call ®-1(x) the corresponding edge of x. 

Besides the point edges, the set E contains the subsets E1 = ((s,n15;) | i = 
1,...,|L|} and Ey = ((na,,£) |j = 1,...,|L? | of directed edges. We call 
the elements of E, and Es the line edges of G. The complete set of edges of 
G is given by E = Ep U E, U E». Figure 9.2 shows the associated graph for 
the triple (A, uv, v)) of Fig. 9.1. 


x=0 
a 
y =3———+—> e e e 
y=2 e e e 
x-y=-2 
y-l e. e. e. 
x-y=-1l 





y=0 








x-y=0 x-y=1 x-y=2 


Fig. 9.1. Example lattice: A = {1, 2,3} x {1, 2,3}, v = (1,0), v® = (1,1). 


Note that the structure of the associated graph is independent of the pro- 
jections p and p). To use the associated graph G for solving a particular 
instance of the reconstruction problem, we assign capacities to the edges of 
G. A capacity function for G is a mapping E — No. We use the following 
capacity function U: 


for i=1,...,|L™|; j =1,...,|L}; (9.2) 
U((m,,no;) = 1; (9.3) 
U((ssm) = p(s); (9.4) 
Ulni t) = pP). (9.5) 


A flow in G is a mapping Y : E — Ro such that Y (e) < U(e) for all 
e € E and such that for all v € V\{s,t}, 


3 Y(wv)- M, Y((ww). (9.6) 


w:(w,v)eE w: (v,w)EE 
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Fig. 9.2. Associated graph G for the triple (A, yy) from Fig. 9.1. 


The latter constraint is called the flow conservation constraint. Flows in 
graphs are also known as network flows in the literature. Let Y be the set 
of all flows in G. For a given flow Y € y, the size T(Y) of Y is given by 
T(Y) = 5^. vyeg Y (8, v)). If we consider G as a network of pipelines, carry- 
ing flow from s to t, the size of a flow is the net amount of flow that passes 
through the network. Due to the flow conservation constraint, we also have 
T(Y) = $w npes Y ((v. t)). The associated graph G has a layered structure: 
All flow that leaves the source s must pass through the point edges. This 
yields the equality T(Y) = Diem, Y (e). If Y (e) € No for all e € E, we call Y 
an integral flow. Note that for any integral flow Y in the associated graph G, 
we have Y (e) € {0,1} for all e € Ep, as the capacity of all point edges is 1. 

There is an elegant correspondence between the solutions of the recon- 
struction problem and the integral flows of maximal size (maz flows) in the 
associated graph G: 


Theorem 1. Suppose that S® = S@) =: T. Problem 1 has a solution if, and 
only if, there exists an integral flow in G of size T. Moreover, there is a 1-1 
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correspondence between the solutions of Problem 1 and the integral flows of 
size T in G. 


Proof. We show first that any integral flow in G of size T corresponds to a 
unique solution of Problem 1. Let Y be a flow in G of size T. For each e € Ep, 
we have Y (e) € {0,1}. Put Fy = {8(e) | e € Ep and Y (e) = 1}, where ® 
is the edge-to-point mapping of G. The set Fy contains all lattice points for 
which the corresponding point edge in G carries a flow of 1. We call Fy the 
corresponding point set of Y. We claim that Fy is a solution of Problem 1. We 
show that PË = p(?; the proof for direction v(2 is completely analogous. 

From the capacity constraints on the line edges of G and the fact that 
T(Y) = S), it follows that all line edges of G must be filled completely by 
Y. Therefore, we have Y ((s,n1,,)) = p® (€,,,) for alli = 1,...,|L(? |. Because 
of the flow conservation constraint at the line nodes of G, we have 


[Z| 
XO ¥((nai,n23)) = pO (4a) for i=1,...,|L%| (9.7) 


j=l 
and, therefore, 


HEC 72,5) |Y (nrin) 21] =p Ca) fori 1,....|20?]. 
(9.8) 
From the structure of G, it follows that 


Fy N bia = {P((n1 i 2,5) | Y((nion2)) =1}, (9.9) 


which yields PO (4) = p™® (4) for i = 1,...,|L(?|. To prove that every 
flow Y of size T in G corresponds to a unique solution of Problem 1, we 
note that Y is completely determined by its values on the point edges of G. 
Therefore, a flow Y’ Z Y of size T must be different from Y at at least one 
of the point edges; hence, Fy; Z Fy. 

We will now show that the mapping from flows of size T in G to solu- 
tions of Problem 1 is surjective. For any solution F of Problem 1, define the 
corresponding flow Yr: 


Yg((nii,n2,)) = (9.10) 


1 if S((ni;,na,j)) EF, 
0 otherwise. 
Specifying Yr on the point edges completely determines the flow through 
the remaining edges by the conservation of flow constraint. We call Yr the 
corresponding flow of F. It is easy to verify that Yr satisfies all edge capacity 
constraints. By definition, F is the corresponding point set of Yr. We have 
T(Yr) = Dw wer, Y ((v. w)) = |F], so Yr is a flow of size |F| = s® =T. 
This shows that the mapping Y — Fy is a bijection. 
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The proof of Theorem 1 shows that we can find a solution of Problem 1 by 
finding an integral flow of size T = S® = S? in the associated graph. This 
flow is a maximum flow in G, because all line edges are completely saturated. 
Finding a maximum integral flow in a graph is an important problem in 
operations research, and efficient algorithms have been developed to solve 
this problem; see Section 9.5. 

The equivalence between the reconstruction problem for two projections 
and the problem of finding a maximum flow in the associated graph was 
already described by Gale in 1957 [7] in the context of reconstructing binary 
matrices from their row and column sums. Theorem 1 generalizes this result 
to the case of any reconstruction lattice A and any pair of lattice directions 
(uv, Q2), 

In the next sections we will see that the network flow approach can be 
extended to solve more complex variants of the reconstruction problem and 
that it can be used as a building block for algorithms that compute a recon- 
struction from more than two projections. 


9.3 Weighted Reconstruction 


Problem 1 is usually severely underdetermined: The number of solutions can 
be exponential in the size of the reconstruction lattice A. In practical applica- 
tions of tomography, the projection data are usually obtained by measuring 
the projections of an unknown object (the original object), and it is important 
that the reconstruction closely resembles this object. One way to achieve this 
is to use prior knowledge of the original object in the reconstruction algorithm. 
One of the first attempts to incorporate prior knowledge in the network flow 
approach was described in [22], in the context of medical image reconstruction. 
In this section we consider a weighted version of Problem 1: 


Problem 2. Let A, v(0, v), p™, p® be given as in Problem 1. Let 
W :A—R bea given mapping, the weight map. Construct a set F C A such 
that po = p and po = p) and the total weight Y „ep W (x) is maximal. 


As a shorthand notation, we refer to the total weight of F as W(F). 
Problem 2 is a generalization of Problem 1. Through the weight map, one can 
express a preference for a particular solution if the reconstruction problem 
has more than one solution. This preference is specified independently for 
each x € A. The higher the weight W(x), the stronger is the preference to 
include x in the reconstruction F. Note that a preference for image features 
that involve several pixels cannot be specified directly through the weight 
map. 

The associated graph G can also be used to solve the weighted version of 
the reconstruction problem. Define the mapping C : E — R as follows: 
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see E is fore € Ep, 


9.11 
0, otherwise . ( ) 


The cost C(Y) of a flow Y in G is defined as »5,c g C(e)Y (e). The min cost 
flow problem in G deals with finding an integral flow Y of a prescribed size 
T in G such that the cost C(Y) is minimal. If we choose T = S(0 = S2, 
any integral flow Y of size T is a maximum flow in G and corresponds to a 
solution of Problem 1. The total weight of the solution that corresponds to a 
flow Y equals -C(Y) = W (Fy). Therefore, solving the integral min cost flow 
problem in G yields a solution of the reconstruction problem of maximum 
weight, solving Problem 2. 

Just as for the max flow problem, efficient algorithms are available for 
solving the (integral) min cost flow problem. However, most such algorithms 
assume that the edge costs are integer values. If the edge costs are all in Q, 
we can simply multiply all edge costs by the smallest common multiple of 
the denominators to obtain integer costs. If the edge costs are not in Q, the 
solution of Problem 2 can be approximated by multiplying all edge costs with 
a large integer and rounding the resulting costs. 

In [22] Slump and Gerbrands described an application of Problem 2 to the 
reconstruction of the left ventricle of the heart from two orthogonal angio- 
graphic projections. They used a min cost flow approach to solve a specific 
instance of Problem 2. 

Having the ability to solve Problem 2 can be very helpful in solving a 
variety of reconstruction problems. We will describe two such problems. These 
problems deal with the reconstruction of binary images, i.e., images for which 
all pixels are either black or white. Each pixel in the image corresponds to a 
lattice point. A binary image corresponds to the lattice set F C A, where F 
contains the lattice points of all white pixels in the image. 


Example 1. As an application of Problem 2, consider an industrial production 
line, where a large amount of similar objects has to be produced. Suppose 
that a blueprint is available, which specifies what the objects should look like. 
Occasionally, flaws occur in the production process, resulting in objects that 
don't match the blueprint. To check for errors, the factory uses a tomographic 
scanner that scans the objects in two directions: horizontal and vertical. To 
obtain a meaningful reconstruction from only two projections, the blueprint is 
used as a model image. For each object on the factory line, the reconstruction 
is computed that matches the blueprint in as many points as possible. 

'This problem can be formulated in the context of Problem 2. Suppose we 
want to reconstruct an nxn image. Put A = {1,...,n} x {1,...,n}, v® = 
(1,0), and v(O = (0,1). Let Fry be the lattice set that corresponds to the 
blueprint. We want to compute the solution F of Problem 1 such that 


(G^ n Fu) U (A\F N A\ Fu) = |A] - |F A Ful (9.12) 


is maximal, where A denotes the symmetric set difference. The term F n Fm 
represents the white pixels shared by F and Fy; the term A\F N ANFy 


9 Network Flow Algorithms for Discrete Tomography 183 


represents the shared black pixels. To formulate this problem as an instance 
of Problem 2, put 


Wiss 1 ifxe Fy, (9.13) 
~ |0 otherwise . ` 


The solution of Problem 2 for this weight map maximizes |FA Fm], the number 
of common elements of F and Fm, subject to the constraints po = p(? and 
PE) = p®, 

For the symmetric difference F A Fy, the following equality holds: 


|F A Ful = (|F| - |F A Ful) +(Ful-|FO Ful). — (914) 


Noting that |F| = S$“ is constant for all solutions of Problem 1 yields 


|(F N Fm) U (A\F n AVE) (9.15) 

=|A| - (S? - |F N Ful) - (Ful - |F N Ful) (9.16) 
=2|F n Fu| + (|A| —|Fu|—S®). (9.17) 

The term (|A| — |Fm| — S“) is constant, which shows that maximizing 


|(F à Fm) U (A\F N AVFEy)| is equivalent to maximizing |F N Fy|. We con- 
clude that the given weight map indeed computes the reconstruction that 
corresponds to the blueprint in as many pixels as possible. 

Figure 9.3(a) shows a blueprint image that represents a semiconductor 
part. The white pixels correspond to the wiring; the black pixels correspond 
to the background. Suppose that the object shown in Fig. 9.3(b) passes the 
scanner. The object clearly contains a gap that is not present in the blueprint 
and should be detected. Figure 9.3(c) shows a reconstruction computed from 
the horizontal and vertical projection data of the faulty object, using the 
blueprint image of Fig. 9.3(a). It has the same projections as the image in Fig. 
9.3(b) and corresponds to the blueprint in as many pixels as possible. Although 
the reconstruction is not perfect, the gap is clearly visible and the object 
can be easily identified as faulty. For comparison, consider the image in Fig. 
9.3(d), which also has the same projections as the images in Fig. 9.3(b) and 
(c). This time, the reconstruction corresponds to the blueprint in as few pixels 
as possible. Comparing this reconstruction to the original image of the faulty 
part shows how severely underdetermined the reconstruction problem is when 
only two projections are available. Of course, using a blueprint image does 
not guarantee that the reconstruction resembles the scanned object, but it is 
likely that the reconstruction will be much better than if no prior knowledge 
is used at all. 


Example 2. Another practical problem that can be formulated in the frame- 
work of Problem 2 is how to obtain a 0-1 reconstruction from an already 
computed real-valued reconstruction. Computing a 0-1 reconstruction from 
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Fig. 9.3. (a) Blueprint image of a semiconductor part. (b) Test image, containing a 
gap in one of the wires. (c) Reconstruction of the test image from the horizontal and 
vertical projections, using the image from (a) as a model image. (d) Reconstruction 
using an inverted version of the blueprint image as a model image. 


more than two projections is a computationally hard problem, but for com- 
puting a real-valued reconstruction several algorithms are available, such as 
the algebraic reconstruction technique (ART, see Chapter 7 of [17]). These 
algorithms typically require many projections to compute an accurate recon- 
struction. Figure 9.4(a) shows an ART reconstruction of the image in Fig. 
9.3(b) from six projections. If we want the reconstruction to be binary, this 
reconstruction can be “rounded,” such that all pixel values less than 1/2 be- 
come 0 and all pixel values of 1/2 or more become 1. The result is shown 
in Fig. 9.4(b). A different way to obtain a binary reconstruction is to solve 
Problem 2 using the pixel values of the original image as the weight map: the 
higher the gray value of a pixel in the continuous reconstruction, the higher 
the preference for this pixel to be assigned a value of 1 in the binary re- 
construction. In this way the reconstruction will perfectly satisfy two of the 
projections, while “resembling” the continuous reconstruction. Figure 9.4(c) 
and (d) show two such reconstructions. The reconstruction in Fig. 9.4(c) was 
obtained using v® = (1,0), v(O = (0,1). For the second reconstruction, the 
lattice directions v) = (0,1), v® = (1,1) were used. Both reconstructions 
are better than the one in Fig. 9.4(b) at some features, but it is not clear 
how to detect automatically which one is better, or how the two solutions 
can be combined into one superior solution. In Section 9.6, we describe how 
the reconstructions for different pairs of lattice directions can be combined to 
compute a single, more accurate reconstruction (see Fig. 9.9). 


9.4 Reconstruction from Noisy Projections 


The network model from Sections 9.2 and 9.3 is only suitable for comput- 
ing reconstructions from perfect projection data. In simulation experiments, 
it is easy to compute perfect projections of a given image, but data that are 
obtained by physical measurements are usually polluted by noise. As an ex- 
ample of what happens in the network of Section 9.2, when the projection 
data contain errors, consider the possibility that S) 4 S@). In this case, it 
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H) 


Fig. 9.4. (a) ART reconstruction of the image in Fig. 9.3(b) from six projections. 
(b) Rounded ART reconstruction. (c) Solution of Problem 2, using the ART recon- 
struction as the weight map, for lattice directions (1,0) and (0,1). (d) Solution of 
Problem 2 using lattice directions (0,1) and (1, 1). 


is clear that no perfect solution of the reconstruction problem exists. One can 
still compute a maximum flow in the associated graph G. Due to the line arc 
capacity constraints, such a flow will always have size at most min(S(), $()). 
If the measured projection for a line ¢ is lower than the number of points on 
that line in the original object, that line will always contain too few points 
in the reconstruction, regardless of the measured line projections in the other 
direction, because of the capacity constraint on the corresponding line edge 
of £. 

In this section we consider a modification of the associated graph which can 
be used to compute a reconstruction F for which the norm of the residue, i.e., 
the difference between the projections of F and the two prescribed projections 
is minimal. This network does not have the drawbacks that we described above 
of the network from Section 9.2. 

Let F C A. For k = 1,2, the projections p of F have finite domains, so 
we can regard P®) as a vector of |L?| elements. We denote the sum-norm 
of this vector by | pc |,. For a given prescribed projection p“), the norm 


k k 
IP? - 9 = 9^ PPO- (9.18) 
£eLO) 


equals the total summed projection difference over all lines in L). Another 
norm that is often used in tomography is the Euclidean norm |- |2. The sum- 
norm is better suited for incorporation in the network flow approach. We now 
define a generalization of Problem 1 that allows for errors in the prescribed 
projections. 


Problem 3. Let A, v(0, v2, p™, p?) be given as in Problem 1. Let T € No. 
Construct a set F C A with |F| = T such that | pO — p + |p — 2 
is minimal. 


Problem 3 asks for a set F that has a prescribed number of T' elements 
such that F corresponds as well as possible to the two prescribed projections, 
according to the sum-norm. If Problem 1 has a solution, we can find all solu- 
tions by putting T = S() and solving Problem 3. We will show that Problem 3 
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can be solved within the network flow model. For any n-dimensional vector 
p € R”, define 


Ip|* = S max(pi, 0) . (9.19) 
i=l 


To solve Problem 3, we need to make some modifications to the associated 
graph. Before introducing the modified graph, we prove the following lemma. 


Lemma 1. Let F C A, |F| = T. Then, for k = 1,2, 


PP — p®™|, = 2| pf? — p [* 450-7. p 


Proof. Let k € (1,2). By definition, we have 


| P9 — (9, = PP — pO[* + |p — PO. (9.21) 





For each line £ € L(?, we have 
P(P (0) = p9 + max(P(? (0) — p? (£),0) — max(p(? (4) — PC? (£),0) . (9.22) 


Summing this equation over all lines £ € L(9), it follows that 





T= S® + PH — y — pO — p+, (9.23) 
hence, 
PE — p |, = APP — p * - s — T. (9.24) 

















Lemma 1 shows that solving Problem 3 is equivalent to finding a set F 
with |F| = T for which 


IPP — pO) + + [PO — pjt (9.25) 





is minimal, since S, S$), and T are constant. 

We will now describe how the associated graph can be modified for solving 
Problem 3. The network from Section 9.2 forms the basis for the new network. 
From this point on we refer to the line edges of the network from Section 9.2 as 
primary line edges. As before, we denote the sets of all primary line edges for 
directions v and v(? by E, and E», respectively. Let ( € L( be any lattice 
line for direction v, and let e € Ej its corresponding primary line edge. The 
capacity of e imposes a hard upper bound on the number of points on £ in the 
network flow reconstruction. To relax this hard constraint, we add a second 
edge for each lattice line, the excess edge. The excess edges are parallel to their 
corresponding primary line edges and have the same orientation. We denote 
the set of excess edges for directions v? and v(2 by E! and E}, respectively. 
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The resulting graph G" is shown in Fig. 9.5. The capacities of the primary 
line edges remain unchanged. The excess edges have unbounded capacities. 
Effectively, this means that the total flow through a primary line edge and its 
corresponding excess edge — both belonging to a line £ € L“*) — is bounded 
by |AN £|, as all outgoing flow from the line edges must pass through |A n £| 
point edges where each has capacity 1. Therefore, it is still possible to assign 
finite capacities to the excess edges. 

'The primary line edges of the new graph are still assigned a cost of 0, as 
in the original network. The excess edges are assigned a cost of K, where K 
is a positive constant. In this way it is possible to allow more points on a line 
L than p? (£), but only at the expense of a cost increase. 

Now consider the problem of finding a min cost flow in G' of size T. 
Without computing such a flow, we can already be sure that any excess edge 
will only carry flow if its corresponding primary line edge is saturated up to 
its capacity. Otherwise, the cost could be decreased by transferring flow from 
the excess edge to the primary edge. 

Suppose that Y : E — Z is a min cost flow in G” of size T. The total cost 
of Y, given by 

CY)=K(X Y() - M Y(o). (9.26) 
ec Ei ec E, 

Let Fy be the set of points for which the corresponding point edges in Y 
carry a positive flow, as in the Proof of Theorem 1. For any line / € L(9, 
the total flow through the primary and excess edges of £ must equal pu (2), 
because of the flow conservation constraints. Therefore, we have 





k 
Y. Ye) = |PL - p" |* ; (9.27) 
ec E, 
hence, 
C(Y) = K(|PẸ — pO |* + PÊ — gi) . (9.28) 


Applying Lemma 1, we conclude that a min cost flow in G” of size T yields a 
solution of Problem 3. 

The new network can also be used to solve an extended version of Prob- 
lem 2. 


Problem 4. Let A, v (0, v(2, p®, p(2 be as given in Problem 2. Let T € No, 
a € Ryo. Construct a set F C A with |F| = T such that 


all Pf? — pO]; + [PP — 521) — V^ Wia) (9.29) 
cer 





is minimal. 
Similar to the procedure for solving Problem 2, we set C(e) = —W(d(e)) 
for all e € Ep. Assuming that an excess edge only carries flow if its corre- 


sponding primary line edge is completely full, the total cost of an integral 
flow Y € y now becomes 
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Fig. 9.5. Modified associated graph G" for the triple (A, vo, Q2) from Fig. 9.1. 


C(Y) = KIPS — pO |* + [PR -pjt — V^ wee). (9.30) 


ce Fy 
Setting K = 2a and using Eq. (9.24) yield 
CY) = of PR) -ph + [PS -»9h)- $ W@)-Co, (931 
xEFy 


where Co is a constant. We conclude that if Y is an integral min cost flow of 
size T in G’, then Fy is a solution to Problem 4. 


9.5 Algorithms and Implementation 


As described in the previous sections, Problem 1, 2, 3, and 4 can all be solved 
as instances of network flow problems. Both the max flow problem and the 
min cost flow problem have been studied extensively. The book [1] provides an 
overview of available algorithms. A survey of the time complexities of various 
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network flow algorithms can be found in [21] (max flow: Chapter 10; min cost 
flow: Chapter 12). 

We now assume that the reconstruction lattice A is a square of size NXN, 
and we fix a pair (v‘)) ,v)) of lattice directions. It is clear that the number of 
points in A on each lattice line parallel to v™ or v2 is O(N). It is also clear 
that the number of lattice lines parallel to v™ or v(2 that have a nonempty 
intersection with A is O(N). 

Problem 1 can be solved as an instance of the max flow problem in the 
associated graph. In [10], Goldberg and Rao describe an algorithm to compute 
a maximum flow in a graph with n nodes, m edges, and maximum edge 
capacity c in O(n?/?m log(n? /m) log c) time. The associated graph of the triple 
(A, v® v) has n = O(N) nodes, m = O(N?) edges, and a maximum edge 
capacity of c = O(N). Therefore, Problem 1 can be solved in O(N*/? log N) 
time. 

Problems 2 and 3 can both be solved as instances of the min cost flow 
problem, i.e., the problem of finding a flow of fixed size that has minimal cost. 
The min cost flow problem can be reformulated as a minimum-cost circulation 
problem by adding an edge from the sink node ¢ to the source node s; see 
Section 12.1 of [21]. In [11], Goldberg and Tarjan describe an algorithm to 
compute a minimum-cost circulation in a graph with n nodes, m edges, and 
maximum (integral) edge cost K in O(nmlog(n? /m)log(nK)) time. For the 
associated graph from Section 9.3, as well as for the modified associated graph 
from Section 9.4, this yields a time complexity of O(N? log(N K)) for solving 
the min cost flow problem. 

'The problem of finding a maximum flow in the associated graph is known 
in the literature as simple b-matching. A flow that saturates all line edges 
is called a perfect simple b-matching, and the weighted variant of finding a 
perfect b-matching is known as perfect weighted b-matching; see Chapter 21 of 
[21]. For these particular network flow problems, special algorithms have been 
developed that are sometimes faster than general network flow algorithms. 

Implementing fast network flow algorithms is a difficult and time-consuming 
task. The fastest way to use such algorithms is to use an existing implementa- 
tion. Several network flow program libraries are available, some commercially 
and some for free. The ILOG CPLEX solver [15] performs very well for a wide 
range of network flow problems. The CS2 library from Goldberg [9] performs 
well and is free for noncommercial use. The same holds for the RelaxIV library 
from Bertsekas [4]. 


9.6 Extension to More Than Two Projections 
As shown in the previous sections, the reconstruction problem from two pro- 


jections is well understood and can be solved efficiently. We now move to the 
case where more than two projections are available. 
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Problem 5. Let n > 2 and let v), ..., v? be given distinct lattice direc- 
tions. Let A C Z? be a given lattice set. For k = 1,...,n, let p(9 : L9 — No 
be given functions. Construct a set F C A such that p? = p) for 
k-l.n 


When more projections are available, the reconstruction problem is less un- 
derdetermined and we would like to be able to use the additional projections 
to increase the reconstruction quality. However, the reconstruction problem 
for more than two projections is NP-hard. Therefore, we have to resort to ap- 
proximation algorithms. In this section we will describe an iterative algorithm 
that uses only two projections in each iteration. Within an iteration, a new 
pair of projections is first selected. Subsequently, an instance of Problem 2 is 
solved to obtain a reconstruction that satisfies the current two projections. 
'The reconstruction from the previous iteration, which was computed using 
a. different pair of projections, is used to construct the weight map of Prob- 
lem 2 in such a way that the new reconstruction will resemble the previous 
one. In this way the other projections are incorporated in the reconstruction 
procedure in an implicit way. 


Compute the start solution F°; 
i := 0; 
while (stop criterion is not met) do 
begin 
i:= i+ l1; 
Select a new pair of directions va and v (1< a « b X n); 


Compute a new weight map W* from the previous solution F*-!; 


Compute a new solution F^ by solving Problem 2 for 
directions v, and v», using the weight map W’; 


end 





Fig. 9.6. Basic steps of the algorithm. 


Figure 9.6 describes the basic structure of the algorithm. In the next sub- 
sections each of the steps will be described. The algorithm relies heavily on 
the methods for solving two-projection subproblems, which we described in 
the previous sections. 


9.6.1 Computing the Start Solution 


At the start of the algorithm, there is no *previous reconstruction"; a start 
solution has to be computed for the iterative algorithm. Ideally, the start 
solution should satisfy two criteria: 
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(a) Accuracy. The start solution should correspond well to the prescribed 
projection data. 

(b) Speed. The start solution should be computed fast (compared with the 
running time of the rest of the algorithm). 


These are conflicting goals. Computing a highly accurate binary recon- 
struction will certainly take too much time, as the reconstruction problem is 
NP-hard. 

There are several options for computing the start solutions, each having 
a different tradeoff between speed and accuracy. The first option is to choose 
the empty set F° = Ø as a start solution, i.e., an image that is completely 
black. 

A better alternative is to use a very fast approximate reconstruction al- 
gorithm, such as one of the greedy algorithms described in [12]. The running 
time of these algorithms is comparable to the time it takes to solve a single 
network flow problem in the body of the main loop of our algorithm. 

A third possibility is to start with a continuous reconstruction. A binary 
start solution can then be computed from the continuous reconstruction, as 
described in Example 2 of Section 9.3. One class of reconstruction algorithms 
that can be used consists of the algebraic reconstruction algorithms (see Chap- 
ter 7 of [17]). The basic idea of these algorithms is to describe Problem 5 as 
a system of linear equations: 


Mz -—b. (9.32) 

X1 
111000000, |” bi 
000111000 i) ba 
000000111 a NM 
100100100 2 EE 
010010010 E bs 
001001001 T bo 

X8 

X9 





(b) 


Fig. 9.7. (a) Numbering scheme for the lattice points and the lattice lines in a 
rectangular reconstruction lattice. (b) System of equations corresponding to the 
numbering in (a). 


Figure 9.7 shows an example 3x3 grid with the corresponding system of 
equations for two directions, v? = (1,0) and v? = (0,1). Each entry of the 
vector x represents an element of A. The entries of the vector b correspond 
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to the line projections for lattice directions v™,...,v(. Each row of the 
binary matrix M represents a lattice line. The entry Mi; is 1 if, and only if, 
its corresponding lattice line 4 passes through point 7. 

The system (9.32) is usually underdetermined. The shortest solution of 
the system with respect to the Euclidean norm |- |2, which we denote as 
x", is a good choice for a start solution in discrete tomography. It can be 
shown that if Problem 5 has several solutions, then the Euclidean distance 
of z* to any of these solutions is the same, so z* is “centered” between the 
solutions. In addition, if the system (9.32) has binary solutions, any of these 
solutions has minimal norm among all integral solutions. Therefore, a short 
solution is likely to be a good start solution. We refer to [13] for the details of 
these arguments. The shortest solution of (9.32) can be computed efficiently by 
iterative methods, as described in [23]. After this solution has been computed, 
a pair (v(?, v?) of lattice directions has to be selected for computing the 
binary start solution. The start solution is computed by solving Problem 2, 
using the pixel values in z* as the weight map. 


9.6.2 Computing the Weight Map 


In each iteration of the main loop an instance of Problem 2 is solved. The 
weight map for this reconstruction problem is computed from the reconstruc- 
tion of the previous iteration; it does not depend on the selected pair of lattice 
directions. 

The weight map should be chosen in such a way that the new recon- 
struction resembles the reconstruction from the previous iteration. In the new 
instance of Problem 2, only two of the projections are used. If the new recon- 
struction is similar to the previous reconstruction, which was computed using 
a different pair of projections, the new image will also approximately adhere 
to the prescribed two projections from the previous iteration. Repeating this 
intuitive argument, we would hope that the new image also satisfies the pro- 
jections from the iteration before the previous one, from the iteration before 
that one, etc. 

'The most straightforward way to make the new reconstruction resemble 
the previous one is to follow the approach from Example 1 in Section 9.3. If 
we put 
1 if(rz,y)eF', 


: (9.33) 
0 otherwise, 


W'((z,y)) = | 
the new reconstructed image F will have the same pixel value as F*^! in 
as many pixels as possible. Unfortunately, this choice usually does not lead 
to good results. Typically, the main loop of the algorithm does not con- 
verge, making it difficult to decide when the algorithm should be terminated. 
'This behavior is by no means surprising. The reconstruction problem from 
a small number of projections is severely underdetermined. If no additional 
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prior knowledge is used, a small number of projections (e.g., four or five) may 
not even be nearly enough data to uniquely determine a reconstruction. 

To deal with this problem, we focus on the reconstruction of images that 
satisfy additional properties. Smoothness is a property that can often be ob- 
served in practical images: Images consist of large areas that are completely 
black or completely white, instead of exhibiting completely random pixel pat- 
terns. A nice property of the smoothness concept is that it can be measured 
locally. We say that an image F is perfectly smooth at pixel x € A if all neigh- 
boring points of x have the same value as x. Of course, this notion requires 
the definition of a neighborhood of x, which we will describe below. 

From this point on, we assume that the reconstruction lattice A is rectan- 
gular. If this assumption is not satisfied, we can use any square reconstruction 
lattice A’ for which A C A’, as increasing the size of the reconstruction lattice 
does not affect the projections. 

Let F'-! be the reconstructed image from the previous iteration. As a 
neighborhood of the point p = (£p, yp) € A, we choose a square centered in 
(Zp, yp). The reason for preferring a square neighborhood over alternatives is 
that the required computations can be performed very efficiently in this case. 
Let p = (£p, yp) € A. Let r be a positive integer, the neighborhood radius. Put 


Np ={(2,y)€ Al ep—r7r <2 < apt, y—r<y<ytr}. (9.34) 


Np contains all pixels in the neighborhood of p, including p. In case p is 
near the boundary of A, the set N, may contain fewer than (2r + 1)? pixels. 
Let sp be the number of pixels q € Np for which F'(p) = F(q). Define 





fo = No : (9.35) 


We call fp the similarity fraction of p. A high similarity fraction corresponds 
to a smooth neighborhood of p. 

Let g : [0,1] —^ Rso be a nondecreasing function, the local weight func- 
tion. This function determines the preference for locally smooth regions. We 
compute the pixel weight W (p) of p as follows: 


Wp) = 3F(») - 509) - (9.36) 


Note that 2(F(p) — 5) is either —1 or +1. 

When we take g(f) — 1 for all f € [0, 1], there is no preference for local 
smoothness. To express the preference, we make the local weight function g an 
increasing function of f,. Now a pixel having a value of 1 that is surrounded by 
other pixels having the same value will obtain a higher weight than such a pixel 
that is surrounded by 0-valued pixels. A higher weight expresses a preference 
to retain the value of 1 in the next reconstruction. The same reasoning holds 
for pixels having a value of 0, except that in this case the pixel weights are 
negative. Three possible choices for the local weight function are 
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(a) 9(fp) = fo, 
(b) 9(fp) = V fps 
(c) 9(fp) = fp - 

The last choice results in a strong preference for pixels that are (almost) 
perfectly smooth. Of course, many other local weight functions are possible. 
In [3], extensive results are reported for the local weight function 


fp) - 4Af (0.65 < fp <1), (9.37) 
9 (fp =1). 


The choice for this particular function is somewhat arbitrary. In each case, 
a preference is expressed for retaining the pixel value of p in the next recon- 
struction, instead of changing it. In the case that the whole neighborhood of 
p has the same value as p, this preference is very strong. If the neighborhood 
contains a few pixels having a different value, the preference is less. If there are 
many pixels in the neighborhood that have a different value, the preference is 
even smaller. 

So far we have not discussed how the neighborhood radius should be cho- 
sen. If the start solution is already a good approximation of the final recon- 
struction, using a fixed value of r = 1 works well. For this neighborhood 
radius, the differences between consecutive reconstructions F* are typically 
small. It is usually better to start with a larger neighborhood radius, e.g., 
r= 5 or r = 8. This typically results in large changes between consecutive 
reconstructions. Only very large regions of pixels that have the same value 
obtain a strong preference to keep this value. Regions that are less smooth 
can easily change. A choice that works well for the range of images studied in 
[3] is to start the algorithm with r — 8 and to set r — 1 after 50 iterations. 


9.6.3 Choosing the Pair of Directions 


In each iteration of the main loop of the algorithm, a new pair of lattice 
directions is selected. There is no selection scheme that is “obviously best" 
in all cases. Yet there are several ways for choosing the direction pairs that 
perform well in practice. 

A good choice for the new direction pair is to choose the lattice directions 
v™, v), for which the total projection error 


[PO — p |, + [PO — pO), (9.38) 





is largest. After solving the new instance of Problem 2, the total projection 
error for these two lattice directions will be zero, assuming perfect projection 
data. This also guarantees that if at least two projections have a positive 
projection error after the previous iteration, both new lattice directions will 
be different from the ones used in the previous iteration. 
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If the number of projections is very small (e.g., four or five) the projection 
error is not a good criterion for selecting the new projection pair. For the case 
of four projections, this scheme leads to cycling behavior between two pair 
of projections. The other projection pairs are not used at all. To avoid this 
behavior, it is better to use a fixed order of direction pairs, in which all pairs 
occur equally often. Such schemes, for four and five projections, are shown in 
Table 9.1. 


Table 9.1. (left) Lattice Direction Scheme for Four Projections (Each projection 
pair is used equally often. No projection pair is used in two consecutive iterations.) 
(right) Lattice Direction Scheme for Five Projections 


Iteration 123456 Iteration 12345678910 
Ist dr. 131212 Ist dir. 1352412345 
2nd dir. 243443 2nd dir. 2413534512 


9.6.4 Stop Criterion 


In general, it is not easy to determine when the iterative algorithm from 
Fig. 9.6 should terminate, because there is no guaranteed convergence. Yet, 
the experiments from [3] show that if enough projections are available, the 
algorithm often converges to the exact solution of Problem 5. Detecting that 
an exact solution has been found is easy, and the algorithm always terminates 
in that case. 

To measure the quality of the current reconstruction F’, the total projection 
difference 


D(F) = Y [PP — 99 (9.39) 
k=1 


can be used. This distance is 0 for any perfect solution of Problem 5 and 
greater than 0 otherwise. The total projection difference can be used for 
defining termination conditions. If no new minimal value is found for the 
total projection distance during the last T iterations, where T is a positive 
integer, the algorithm terminates. We used T = 100 for all experiments in the 
next subsection. 


9.6.5 Some Results 


We will now show some experimental results obtained by the iterative al- 
gorithm from Fig. 9.6. The performance of the algorithm, as for any other 
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general discrete tomography algorithm, depends heavily on the type of image 
that is being reconstructed and the number of available projections. In order 
to give extensive statistical results about the reconstruction quality, a class 
of images has to be defined first. All images in this class should have simi- 
lar characteristics. The performance of the algorithm can then be measured 
for this particular image class. In [3], reconstruction results were reported 
for several different image classes. The results in this section show a varied 
set of test images with their reconstructions, rather than providing extensive 
quantitative results. Figure 9.8 shows six test images, each having different 
characteristics, and their reconstructions. The number n of projections that 
was used is shown in the figure captions. The reconstructions of the first five 
images (a-e) are all perfect reconstructions, obtained using the weight func- 
tion in Eq.(9.37) from Section 9.6.2. For the first four images (a-d), the linear 
local weight function also works very well, even faster than the function in 
Eq.(9.37). The image in Fig. 9.8(e) contains many fine lines of only a single 
pixel's thickness. In this case, the local weight function g(fp) = \/fp works 
well. Which local weight function is best for a certain class of image depends 
strongly on characteristics of the particular class. This is also true for the 
number of projections required to reconstruct an image. 

For reconstructing the image in Fig. 9.8(a), four projections suffice. The 
structure of the object boundary is fairly complex, but the object contains no 
“holes.” The iterative algorithm reconstructed the image perfectly from four 
projections (horizontal, vertical, diagonal, and antidiagonal). 

Figure 9.8(b) shows a much more complicated example. The object con- 
tains many cavities of various sizes and has a very complex boundary. Some 
of the black holes inside the white region are only a single pixel in size. In 
this case, our algorithm requires six projections to compute an accurate re- 
construction. Some of the fine details in this image are not smooth at all. Still, 
the fine details are reconstructed with great accuracy. The image from Fig. 
9.8(c) is even more complex. It requires eight projections to be reconstructed 
perfectly. 

The image in Fig. 9.8(d) has a lot of local smoothness in the black areas, 
but it contains no large white areas. Still, the image is smooth enough to be re- 
constructed perfectly from only five projections. T'his example also illustrates 
that very fine, nonsmooth details can be reconstructed by the algorithm, as 
long as the entire image is sufficiently smooth. 

Figure 9.8(e) shows a vascular system containing several very thin vessels. 
The iterative algorithm can reconstruct the original image perfectly from 12 
projections. This is quite surprising, since the very thin vessels have a width 
of only one pixel, so they are not smooth. Still, the smoothness of the thicker 
vessels and the background area provides the algorithm with enough guidance 
to reconstruct the original image. 

When the image contains no structure at all, the algorithm performs very 
badly. Figure 9.8(f) shows an image of random noise. The reconstruction from 
12 projections shows that our algorithm has a preference for connected areas 
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of white and black pixels. In this case, however, the smoothness assumption 
is obviously not satisfied by the original image. The distance between the 
projections of the image found by our algorithm and the prescribed projections 
is very small, however. 





(a) Original image. Reconstr., n = 4. (b) Original image. Reconstr., n = 6. 
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Reconstr., n = 8. (d) Original image. Reconstr., n = 5. 
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(e) Original image. Reconstr., n — 12. (f) Original image. Reconstr., n — 12. 
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Fig. 9.8. Six original images and their reconstructions. The number n of projections 
is shown in the figure caption. [The images in Fig. 9.8(f) show a zoomed portion of 
the center of the actual images, to make the details clearly visible.] 


For reconstructing the images in Fig. 9.8, a sufficiently large set of pro- 
jections was used. Figures 9.9 and 9.10 demonstrate the result of using the 
algorithm if too few projections are available. 

Figure 9.9 shows the results of reconstructing the semiconductor image of 
Fig. 9.3(b) from three and four projections, respectively. When we use only 
three projections, a reconstruction is found that has exactly the prescribed 
projections, but the reconstruction is very different from the original image. 

If we use too few projections, the algorithm may also get stuck in a local 
minimum of the projection distance, which is shown in Fig. 9.10. The original 
image can be reconstructed perfectly from five projections, but when only 
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four projections are used, the algorithm fails to find a good reconstruction. 
The projections of the reconstructed image are significantly different from 
the four prescribed projections, yet the algorithm is unable to find a better 
reconstruction. 





bid 


Original image. Reconstruction, n — 3. Reconstruction, n — 4. 


Fig. 9.9. (a) Original image. (b) Reconstruction from three projections (horizontal, 
vertical, diagonal) that has exactly the prescribed projections. (c) Perfect recon- 
struction of the original image from four projections (horizontal, vertical, diagonal, 
antidiagonal). 
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Original image. Reconstruction, n — 4. Reconstruction, n — 5. 
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Fig. 9.10. (a) Original image. (b) Reconstruction from four projections, which 
does not have the prescribed projections. The horizontal and vertical projections are 
identical to those of the first image. The diagonal and antidiagonal projections have 
a total projection difference (sum of absolute values) of 184 and 126, respectively. 
(c) Perfect reconstruction of the original image from five projections. 


9.7 Reconstructing 3D Volumes 


So far our approach has been concerned with the reconstruction of two-dimen- 
sional images. In many practical applications of tomography, it is important to 
obtain 3D reconstructions. Computing 3D reconstructions is usually a compu- 
tationally very demanding task, as large amounts of data are involved. There 
is a slight difference in terminology between 2D and 3D reconstructions. Pix- 
els in 2D images are usually called vozels in the context of 3D images, where 
they represent a unit cube in the 3D volume. 
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If there exists a plane H in Z? such that all projection directions lie in 
H, all algorithms for 2D reconstruction can be used directly for 3D recon- 
struction as well, reconstructing the volume as a series of slices. All slices can 
be reconstructed in parallel, which allows for a large speedup if several pro- 
cessors are used. A disadvantage of reconstructing all slices independently is 
that certain types of prior knowledge cannot be exploited. For example, if we 
generalize the preference for local smoothness from Section 9.6 to 3D, voxels 
from adjacent slices are required to compute the neighborhood density of a 
certain voxel. Therefore, the reconstruction computations of the slices are no 
longer independent. 

If the projection directions are not coplanar, reconstructing the volume 
as a series of slices is not possible. This situation occurs, for example, in the 
application of atomic resolution electron microscopy. The crystal sample is 
tilted in two directions to obtain as many useful projections as possible. 








Fig. 9.11. 3x 3 x 3 binary volume with its projections in directions v? = (1,0,0) 
and v(? = (0,1, 0). A large circle indicates a value of 1; a small circle indicates a 
value of 0. 


We will now show how the algorithm for 2D reconstruction from the pre- 
vious section can be generalized to the 3D case. Figure 9.11 shows an example 
of a 3 x 3 x 3 volume A with its projections parallel to the lattice directions 
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v® = (1,0,0) and v? = (0,1,0). Lattice points that have a value of 1 (i.e., 
lattice points included in the set F) are indicated by large dots. Similar to 
the associated network from Section 9.2, each two-projection problem in 3D 
also has an associated graph. The associated graph for the volume in Fig. 
9.11 is shown in Fig. 9.12. Just as in the 2D case, the associated graph con- 
tains a line edge for every projected lattice line. The middle layer of edges 
contains one edge for every voxel, connecting the two line nodes for which the 
corresponding lines intersect with that voxel. 
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Fig. 9.12. Network corresponding to the two-projection reconstruction problem 
in Fig. 9.11. 
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Figure 9.12 shows a nice property of the two-projection reconstruction 
problem. For any point edge (ni;,n2,j) € Ep in the associated graph, the 
lines /1,; and £»,; have a nonempty intersection in A, so there is a plane in Z? 
that contains both 41, and £5,;. Since £4, and £5,; are translates of v@) and 
v), respectively, this plane will always be a translate of the plane spanned 
by v) and v(2, If two lines lı i and £5; lie in different translates of this 
plane, there will be no voxel edge connecting the corresponding line nodes. 
'Therefore, the max flow problem can be solved for each translate of the plane 
independently. In the example network of Fig. 9.12, the subproblems for each 
of the planes z = 0, z = 1, and z = 2 can be solved independently. This 
property holds for any pair (v(9), v©)) of lattice directions, although the sizes 
of the subproblems depend on the direction pair. The number of point edges 
in each subproblem is bounded by the maximal number of voxels in A that 
lie in a single plane. 
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(d) 1000 small spheres, 139x 139x139: perfect reconstruction from 6 projections. 


Fig. 9.13. Reconstruction results for four 3D volumes. 
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Figure 9.13 shows four different test volumes, each displayed from three 
different viewing directions. The directions were selected to provide a clear 
view of the volume; they are not parallel to any of the projection directions. 
The iterative network flow algorithm can reconstruct each of these images from 
projections along the six lattice directions (1,0,0), (0, 1,0), (0,0, 1), (1, 1,0), 
(1,0, 1), and (0,1, 1). The image dimensions are shown in the figure. For the 
test image in Fig. 9.13(a), a perfect reconstruction is already found if only the 
first four projections are used. The test volumes are surrounded by a black 
background, which is not counted in the image dimensions. For all four test 
volumes, the algorithm computed the 3D reconstruction within 7 minutes on 
a standard 2.4GHz Pentium IV PC. 


9.8 Extension to Plane Sets 


So far we have considered the reconstruction of lattice sets in 2D and 3D. This 
model is well suited for the application of nanocrystal reconstruction at atomic 
resolution in electron microscopy [16]: Atoms in a crystalline solid are arranged 
regularly in a lattice structure. In many other applications of tomography 
there is no "intrinsic lattice." In this section, we consider the reconstruction 
of subsets of IR? from its projections in a small number of directions. We 
will also refer to such subsets as planar black-and-white images. In this new 
context, the projection along a line is no longer a sum over a discrete set; 
rather it is a line integral or strip integral of a function IR? — {0,1}, which 
yields a real value. 

Binary tomography problems without an intrinsic lattice structure occur 
often in practice, for example, in medical imaging [14]. Besides using a pixel 
representation for the reconstructed image, other representations have also 
been proposed. If the object of interests can be approximated well by a poly- 
gon, for example, one can use a polygonal, representation in the reconstruction 
algorithm (see [18]). 

'The iterative network flow algorithm can be adapted in such a way that it 
can be used for the reconstruction of planar black-and-white images. We will 
only give a high-level overview of the algorithm. The details will be described 
in a future publication. 

Figure 9.14 shows an example of a planar black-and-white image along 
with two of its projections. If strip projections are used, the total amount 
of “white” (or black) in a set of consecutive strips parallel to the projection 
direction is measured. As it is impossible to represent all planar images in a 
computer program, they are approximated by representing the images on a 
pixel grid. Each of the pixels can have a value of either 1 (white) or 0 (black). 

The weighted reconstruction problem for two projections (i.e., Problem 
2 in the context of lattice sets) can also be solved efficiently in the case of 
planar subsets. However, a specifically chosen pixel grid must be used, which 
depends on the two projection directions. Figure 9.15 shows how the grid is 
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Every pixel on the grid is the intersection of a 
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strip in the first direction with a strip in the second direction. The network 


flow approach can be used for this pixel grid, to compute a 0-1 reconstruction 


projections are used, the total amount of “black” in a set of consecutive strips 
from the given two projections. 


Fig. 9.14. A planar black-and-white image with one of its projections. If strip 
parallel to the projection direction is measured. 


formed from two projections. 


Fig. 9.15. Two parallel beams span a pixel grid. On this pixel grid, network flow 


methods can be used to solve the two-projection 0-1 reconstruction problem. 
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In every iteration of the iterative network flow algorithm, a new pair of 
projections is selected. Therefore, the pixel grid on which the new solution is 
computed is different in each iteration. To compute the weight map for the new 
pixel grid, the image from the previous iteration (defined on a different pixel 
grid) is first converted to a grayscale image on the pixel grid by interpolation. 
Recall that the computation of the similarity fraction for a pixel p does not 
require that neighboring pixels of p are binary. Therefore, the new weight 
map can be computed in a straightforward way. An overview of the adapted 
version of the iterative algorithm is shown in Fig. 9.16. 


Compute the start solution F? on the standard pixel grid; 
$:— 0; 

while (stop criterion is not met) do 

begin 


i:=i +1; 
Select a new pair of directions v? and v (1 <a < b € n); 


Convert the previous solution F^^! to an image F*- that is defined 


on the pixel grid spanned by directions v(9 and vO; 


Compute a new weight map W? from the image F*~!; 


Compute a new solution F* on the grid spanned by v and v® 
by solving a variant of Problem 4, using the weight map W’. 


end 





Fig. 9.16. Basic steps of the algorithm for plane sets. 


Just as for lattice images, the iterative network flow algorithm for plane 
sets is capable of computing very accurate reconstructions if a sufficient num- 
ber of projections is available. However, if the original image does not have 
an intrinsic lattice structure, we cannot expect a reconstruction that perfectly 
matches the projection data, as it is unlikely that the original image can be 
represented as a binary image on the pixel grid used by the algorithm. 
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A Convex Programming Algorithm for Noisy 
Discrete Tomography 


T.D. Capricelli and P.L. Combettes 


Summary. A convex programming approach to discrete tomographic image recon- 
struction in noisy environments is proposed. Conventional constraints are mixed with 
noise-based constraints on the sinogram and a binariness-promoting total variation 
constraint. The noise-based constraints are modeled as confidence regions that are 
constructed under a Poisson noise assumption. A convex objective is then minimized 
over the resulting feasibility set via a parallel block-iterative method. Applications 
to binary tomographic reconstruction are demonstrated. 


10.1 Introduction 


The tomographic reconstruction problem is to estimate a multidimensional 
signal z in a Hilbert space H from lower-dimensional measurements of its 
line integrals. In computerized tomography, the signals are discretized over 
a bounded domain and H is the usual Euclidean space R (using standard 
lexicographic ordering, an N-voxel signal is represented as an N-dimensional 
vector). A popular approach for solving this problem is to pose it as a convex 
feasibility problem of the form 


m 
find FE()S;, (10.1) 
i=1 
where (Si)i<i<m are closed convex sets in IR^ arising from prior knowledge 
(bounds, support information, spectral information, information about the 
noise corrupting the measurements, etc.) and the discrete line integral (line 
sum) measurements [4, 6, 21, 31]. This set theoretic approach to tomographic 
image reconstruction goes back to [18]; further developments can be found in 
(24, 25, 30, 32, 35]. In some instances it may be justified on physical grounds 
to seek a feasible image that is optimal in some sense. The problem then 
assumes the form 


find TES — ("| S; such that Y(T) = inf e(S), (10.2) 


i=l 
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where o: RN — (—oo, +00] is a convex function. The advantage of this convex 
programming formulation is that it allows the incorporation of a wide range 
of prior information in the reconstruction process, and, at the same time, it 
benefits from the availability of powerful algorithms; see [1, 4, 7, 8, 9] and the 
references therein. 

In discrete tomography, the range of the signal Y to be reconstructed is 
known to be a finite set, for instance, the set {0,1} in binary tomography. 
'This additional information is of paramount importance and has profound 
consequences on the theoretical and practical aspects of the reconstruction 
problem [22]. As underlined in [22], classical computer tomography algorithms 
do not perform well in the presence of few (say 10 or less) views (we do not 
employ the standard term “projection,” as it will be reserved to describe the 
best metric approximation from a convex set). Consequently, they are not 
directly applicable in discrete tomography, where such low numbers of views 
are common. Furthermore, classical algorithms do not exploit nor enforce the 
discrete nature of the original signal. 

In this chapter, we propose a convex programming approach to the discrete 
tomography problem in noisy environments. In recent years, many papers have 
been devoted to the theoretical and numerical investigations of discrete recon- 
struction problems; see [15, 17, 19, 22, 23, 29, 34] and the references therein. 
The novelty of our work is to propose a convex programming formulation 
of this problem that explicitly takes into account the presence of noise in the 
measured data, and to provide a numerical method to solve it. Our formulation 
is of the form (10.2), and our algorithm is based on the block-iterative meth- 
ods recently developed in [8, 9]. While our approach is applicable to general 
discrete problems, we shall focus on the case of binary images for simplicity. 
Since the set of binary images is nonconvex, our first task will be to find per- 
tinent convex constraints that will promote the binary nature of the image: 
Total variation will be used for this purpose. Thus, the image produced by 
the algorithm will be relatively close to being binary, which will minimize the 
number of errors incurred by the final binarization step. Other constraints will 
exploit standard information (bounds, support) as well as information about 
the data model and the noise. 

'The remainder of the chapter is organized as follows. In Section 10.2, we 
review the parallel block-iterative algorithm that will be employed to solve 
the convex program (10.2). In Section 10.3, we address the construction of 
constraints for noisy binary tomography. The new constraints are confidence 
regions that are based on statistical attributes of the noise perturbing the sino- 
gram, and a total variation constraint aimed at promoting the binary nature 
of the reconstructed image. A Poisson noise model is assumed in our statis- 
tical analysis of the confidence regions. In Section 10.4, we describe several 
applications of this convex programming framework to binary image recon- 
struction in the presence of noisy measurements. A few remarks conclude the 
chapter in Section 10.5. 
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10.2 Proposed Algorithm 


Throughout the chapter, the signal space containing the original image T is 
the standard Euclidean space RY, with scalar product (- | -), norm ||- ||, and 
distance d. The distance to a nonempty set C C RY is do(x) = inf ||x — C ||. 
If C c RP is nonempty, closed, and convex, then, for every x € RY, there 
is a unique point Pox € C such that ||x — Poz|| = dc(x); Pez is called the 
projection of x onto C. 

To solve (10.2), we use the parallel block-iterative algorithm described 
in [9], where the framework of [8] was adapted to problems with quadratic 
objective functions. Although more general strictly convex objectives y can 
be used [8], we restrict our attention to quadratic functions in this chapter, 
as they lead to particularly simple implementations. 

Let 





yg: RN >R: zo (R(x—r) | r), (10.3) 


where r € RN and R € R* is a positive definite symmetric matrix. In addi- 
tion, we suppose (without loss of generality) that the closed convex constraint 
sets (S;)i<i<m in (10.2) assume the form 


(Vi € {1,...,m}) S= {x ER” | fi(z) < ài), (10.4) 


where (fi)1<i<m are convex functions from R to R and (d;)1<i<m are real 
numbers such that S = (Y^, S; # Ø. Recall that, under these assumptions, 
for every z € RY, each f; admits at least one subgradient at x, i.e., a point 
gi € R” such that [27] 


(vy RY) (y- x |g) + fila) < fiy) (10.5) 


For instance, if C is a nonempty closed convex subset of RY and z € RN « C, 
then 
Bdgtej S to copo dela). (10.6) 


'The set of all subgradients of f; at x is the subdifferential of f; at x and 
is denoted by Of;(r); if f; is differentiable at x, then Of;(r) = (V fi(x)). 
Moreover, the subgradient projection G;z of x onto S; is obtained by selecting 
an arbitrary gi € Of;(x) and setting [7] 
di — fix) 
Ges z+ TAE gi if fi(x) > ói, 


(10.7) 


The algorithm proposed in [9] to solve (10.2)-(10.3) constructs a sequence 
(£n)nen of approximate solutions as follows: 


1. Fix e € (0, 1/m). Set zo =r and n = 0. 
2. Take a nonempty index set L, C [1,...,m]. 


210 T.D. Capricelli and P.L. Combettes 


3. Set zn = £n — A,R-1u,, where: 
(a) for every i € In, Din = Gin} 
(b) the weights (win)ier, lie in [e, 1] and » jer, Win = 1; 
(c) Un = Xn — Dién Wi nPi,n; 
(d) An € [eLn, Ln], where 


3 vinlin = znali? 


i€ In : E 
a. ee s PURUS 
1/]R-1|| otherwise. 


4. Set tm, = (R(xo — £n) | tn — Zu), Un = (R(xo — £n) | Xo — En), va = 
(R(an — Zn) | n — Zn), and ps = UnVn — 72. 
5. Set 


Zi ifo, =0, Tn 20, 


T 
so (14 ZE) len = an) if Pn > 0, aV = pn, 


Intl = Un 


V, f 
En + — (Ts (2o — En) + Hn(zn — £n)) if Pn > 0, Tav < pn. 


Pn 
6. Set n 2 n + 1 and go to step 2. 
The following convergence result is an application of [9, Theorem 16]. 


Theorem 1. Suppose that there exists a strictly positive integer J such that 


n4j-J—1 
(vneN) U m-tL...m). (10.8) 
k=n 


Then every sequence (£n)nen generated by the above algorithm converges to 
the unique solution to (10.2)-(10.3). 


Some comments about this result and the algorithm are in order. 


(a) Condition (10.8) is satisfied in particular when I, = (1,...,m], i.e., when 
all the sets are activated at each iteration. In general, (10.8) allows for vari- 
able blocks of sets to be used, which provides great flexibility in terms of 
parallel implementation (see [7] for examples). More details on the im- 
portance of block-processing for task scheduling on parallel architectures 
can be found in [4]. Further flexibility is provided by the fact that the 
relaxations and the weights can vary at each iteration. 

(b) The algorithm activates the constraints by means of subgradient projec- 
tions rather than exact projections. The former are significantly easier 
to implement than the latter, as they require only the computation of 
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subgradients (gradients in the differentiable case). Analytically complex 
constraints can therefore be incorporated in the recovery algorithm and 
processed at low cost. 

(c) The parameter Ln is always at least equal to 1/||R~"|| [9, Proposition 12], 
and it can attain large values. Choosing A; large (e.g., equal to Ln) usually 
yields faster convergence. 


In [9], it was shown that in order to reduce the computational load of the 
method, an iteration of the algorithm could be implemented as follows (only 
one application of the matrices R and R^! is required): 

1. For every i € In, set a; = — fi(z)9i/||gi||?, where gi € Of;(xn), if fí(x4) > 
ĉi; a; = 0 otherwise. 
2. Choose weights (wi);er, in [e, 1] adding up to 1. Set v = » /;c;, wia; and 

L= Yer willall?. 

3. If L = 0, set z441 = Ln, and exit iteration. Otherwise, set b = £o — £n, 

c= Rb, d= R^ lv, and L = L/ (d | v). 

4. Choose A € [EL, L] and set d = Ad. 
5. Set m = — (c | d), p= (b |c), v = à (d |v), and p= uv — 7°. 
In +d if p=0 and 7 > 0, 
6. Set tna. = 4 tot(l+n/v)d if p» 0 and mv 2 p, 
Ln + Z (xb + ud) if p 7» 0 and mv < p. 
p 





Remark 1. If the projector P; onto S; is easy to implement, one can set f; = 
dg, since S; can certainly be described as the level set 


S; = {x € R | ds,(z) < 0}. (10.9) 
In this case, it follows at once from (10.6) and (10.7) that G; = P; (the 


subgradient projector reduces to the usual projector) and, moreover, that 
ai = Pi£n — £n at step 1 of the algorithm. 


10.3 Construction of Closed Convex Constraint Sets 


10.3.1 Data Model 


The sinogram is the image under the Radon transform of the original image T. 
'The portion of the sinogram corresponding to a given observation angle 0 will 
be referred to as a view. The observed data consist of q noisy views (zi)1«i«q 
at angles (0;)1«i«4. For every i € {1,...,q}, we let L; be the restriction of the 
Radon transform for a fixed angle 0;. In other words, the ith measurement is 


where w; is the noise vector corrupting the observation. Each view z; is a 
one-dimensional signal of M points and will be represented by a vector z; = 
[Gi] T2, p in RM; Li is therefore a matrix in RMX”. Finally, we denote by 
1 the vector [1,...,1]7 in RN or RM, 
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10.3.2 Standard Constraints 


Further details on standard constraint sets can be found in [6, 31]. 


Range 


The first standard constraint arises from the fact that pixel values are nonneg- 
ative and have known maximal value. After normalization, the corresponding 
set is 

lot; (10.11) 
In the context of binary tomography, this is simply the convex hull of the set 
{0,1}" 
Support 


A second common piece of a priori information in tomography is the knowledge 
of the support K of the body under investigation. The associated set is 


{x ER | alk = x), (10.12) 


where 1x is the characteristic function of K and where the product x1, is 
taken componentwise. The projector onto this set is x > a1x [6]. 


Pixel Sum 
Let u be the sum of the pixel values in the original image, i.e., 
y= ( |1). (10.13) 
The knowledge of u leads to the set 
{z ER” | (z |1) =p}. (10.14) 
Since u is never known exactly, this set should be relaxed into the hyperslab 
{x ERN | pw x (z|1) € pt}, (10.15) 


where [1^ , u^] is a confidence interval. The projector onto this set is [6] 





e 1 
ep Ply gena, 
+ 1 
T> ir uar if (x |1) > pt, (10.16) 
N 
x otherwise. 


The values of u^ and u* depend on prior information about the experimental 
setup and about the noise. An example is provided in Subsection 10.3.5. 
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10.3.3 Binariness-Promoting Constraint 


The constrained image recovery method presented in Section 10.2 is limited to 
problems with convex constraints. As a result, since the set of binary images 
is nonconvex, the binariness constraint cannot be enforced directly in such 
a framework. Furthermore, the convex set (10.11) will not properly enforce 
binariness, and we must find a more effective means to promote binariness 
through a convex constraint. 

In recent years, total variation has emerged as an effective tool to recover 
piecewise-smooth images in variational methods. This approach was initiated 
in [28] in the context of denoising problems and has since been used in var- 
ious image recovery problems. Recently, it has also been applied in certain 
variational computerized tomography problems [2, 36]. In all of these ap- 
proaches, total variation appears in the objective of a minimization problem. 
Such nondifferentiable problems are not easy to solve, and they offer very 
limited potential in terms of incorporating constraints [5, 11]. 

In [12], it was observed that, in many problems, the total variation tv(Z) 
of the original image, which measures the amount of oscillations, does not 
exceed some known bound 7. This constraint, which is associated with the set 


{x € RP | tv(z) <7}, (10.17) 


appears to be particularly relevant in binary (and more generally in discrete) 
tomography, as it attenuates the oscillating components in the image, thereby 
forcing the creation of flat areas and promoting binariness. An important issue 
is, of course, the availability of the parameter 7 in (10.17). In this respect, 
binary tomography places us on favorable grounds. Indeed, since the total 
variation of a binary image is related to the length of its contours (i.e., the 
sum of the perimeters of the elementary shapes), it can be estimated with good 
accuracy in certain typical problems from prior experiments or by sampling 
databases [12]. 

Numerically, the total variation of a discrete image x = [£;,j] € RVNxVN 
is computed as 








VN-1VN-1 
tv(z) 2 M5 M VlEitig — &P + léji — Gal? 
i=l j=l 
VN-1 VN-1 
+ M ltuy ilt X yrs Emale (018) 
i=1 j=l 


The subgradient projector onto the set (10.17) can be found in [12]. 


10.3.4 Constraints on the Residual Views 


Using an approach developed in [13] and [33], we can form a wide range of 
statistical constraints modeled by closed convex sets for each of the q views. 
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Indeed, (10.10) gives 


Hence, the residual signal z; — L;x associated with an estimate x of z should 
be constrained to be statistically consistent with every known attribute (e.g., 
moment, periodogram, etc.) of the noise. Consistency is usually enforced via 
some statistical confidence bound (see [10] for an analysis of the global confi- 
dence level in terms of confidence levels of each set). We now provide examples 
of such constraint sets. 


Amplitude 


Let us denote by (ex)i«x« the canonical basis of IRM (recall that M is the 
length of each view z;). If no noise were present, (10.10) would confine % to 
the intersection of the Mq hyperplanes 


(x € RF | (Liz | ex) = (zi | ex) )- (10.20) 


These sets were used in the early ART reconstruction technique [18]. In the 
presence of noise, the hyperplanes must be replaced by hyperslabs of the form 
[20] 

{ax € RN | | (Lix —À | ek) | < ain}. (10.21) 


The confidence interval [—a;,4, a;,,] can be determined from the distribution 
of the random variable (w; | ex). Such distributions can be available in certain 
problems, e.g., [6, 14, 33]. However, in the present problem, the noise is best 
modeled by a signal-dependent process that makes it impossible to obtain 
reliable bounds. We shall therefore not use these sets in our experiments. 


£? Norms 


Let p € [1,+00) and i € {1,...,q}. Suppose that an estimate d of the £P? 
norm of the noise vector w; is available from physical considerations or past 
experience with tomographic reconstructions of similar objects. Then one can 
construct the set [13] 


{x ERY ||Liz — zi]? < dip} . (10.22) 


The projector onto this set has no simple closed form, but the subgradient 
projector can be obtained as follows. Let us denote by (Li k)ı<k<m the M 
rows of L; and by Lj; an entry of the matrix Lj, i.e., 


Loc Pao eese (10.23) 
IEIEN 


Then, by elementary subdifferential calculus [27], 
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M 
O||Lix — zll = Y O| (Lik |2) — GP. (10.24) 


k=1 


Hence, if p > 1, the unique subgradient of x > ||Liax — z;||5 is 


M 
9=P (x Lia] (Lie | x) — Gal" sign((Lin | 2) — o) 
1<1<N 


k=1 
(10.25) 
Now suppose that p = 1. Recall that 
—1 if£«0, 
sign: Em 40 if € — 0, (10.26) 
41 if€>0, 
is a selection of the subdifferential of £ — |£|, which is given by 
[-1)  if£«0, 
(YEER) oj — 4[-1,1] if£-0, (10.27) 
{+1} if€>0. 
Therefore, a subgradient of x + || Lix — zi||1 is 
M 
g= (> Lina sign((Lin | x) — ca] l (10.28) 
k=1 1<I<N 


One can compute the subgradient projection (10.7) via (10.25) and (10.28). 


Energy 
The case p = 2 corresponds to the set 
{x € RN | |Liz — zil? < 6:2}. (10.29) 


In this case, (10.25) reduces to g = 2LT (Liz — 2;). As discussed in [7, 33], the 
projection of an image x onto this set requires an iterative procedure, while 
the subgradient projector is given explicitly by (10.7) as 


6,2 = || Lia = zl? 


+ pet Ste RU ae es LEN 
ze 4^. 2ILT(La — all 


LI(Liz—z) if |Liz— zill? > 6:2, 
Liem) if Mond T e 49 99 
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10.3.5 Bound Estimation in the Case of Poisson Noise 


In this subsection, we address the problem of computing the parameters j^ 
and pt in (10.15) and ô; 2 in (10.29). 

Noise modeling in computerized tomography is a research topic in its own 
right, and it is beyond the scope of the present chapter to attempt to provide a 
precise model for the various complex underlying physical phenomena. In [3], a 
simple additive Gaussian noise model is considered. Since many data collection 
processes in discrete tomography are counting processes (e.g., counting the 
number of atoms in a structure along a certain direction), we adopt here a 
Poisson noise model. More specifically, we assume that the observation z; in 
(10.10) is a realization of a random vector 


T 


Z= [Zir] repens , 


(10.31) 


the components of which are independent Poisson variables with means 
(Aik)i<k<m. It is also assumed that the random vectors (Zi)ı<i<q are in- 
dependent. Now, let A; be the mean of Z;. Then 


A; = EZ; = [Aijelicne ms = LiT. (10.32) 


Bound on the View Sums 


Our purpose here is to determine the parameters u^ and pt in (10.15). A 
property of the discrete Radon transform is that it preserves pixel sums in 
the sense that 


(Vie (L...,a) w= (F |1) = (LF |1). (10.33) 
Since (Z; | 1) is the sum of M independent Poisson variables, it is also a 


Poisson variable, with mean (A; | 1) = u and variance var (Z; | 1) = p. The 
parameter u can be approximated by the sample mean of the q views, i.e., 


1. 
y--M (a |1). (10.34) 
Tul 
'The associated statistical estimator is 
14 
p Zi |1), 10.35 
P 2 | 1) ( ) 
with mean " 
1 
Er = -— E(Z; |1)= 10.36 
2 2. (Zi |l) =p ( ) 


and variance 
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12 
varl = — Ds var (Zi | 1) = A (10.37) 


We look for a confidence interval of the form 
[ E — &y v varl, EI + fiv var | = | n bı v u/q, w+ iV ufq]. (10.38) 


for some £1 > 0. Upon approximating u by the observed sample mean y, 
the confidence interval becomes | y — bı /^/q, Y + bıy/7/4 ]. Monte Carlo 
experiments show that £i = 2.3 gives 98% of the realizations within this 
interval. Thus, with a confidence level of c1 = 98%, we can use the values 


W = 7-238 /7/q and p” =7+2.3/y/¢ (10.39) 


n (10.15). 


Bound on the Residual Energy 


Let i € {1,...,q}. Our purpose is to determine the parameter 6;,2 in (10.29). 
'The mean square residual error is 


M M 
E|Z; - Liz]? = E|Zi - A]? = M varZis = X dik = |All (10.40) 


To compute E||Z; — L;z||5, we need the moments of Z;; up to order 4. We 
have 
E| Ziel? = My 324 + Aik (10.41) 


and 





E|Zi klt = Ma + 6224 + TAZ p + Aik (10.42) 
The fourth central moment is fiietsioie | 
E|Zi — Ail? = E| Zi klt — 4A; E|Zi s? + 6A2,E| Zi? — AM pEZi + M 
= 3X7, + Auk. (10.43) 
We can now compute the second-order moment of the residual energy as 


E\|Z; — L;z||^ = E[Zi — Ai|j 


M 2 
> IZi. — dial? 








=E 
M 
= El Zik — Aik +2 5 E|Zi; — Aa,j|7E| Zik — Ail? 
kel 1<k<j<M 
M 
= MX, + Ain) 3) uus 
kal 1<k<j<M 


= 2| Ai? + Ala DASS , (10.44) 
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and its variance as 


var|Z; — L|? = EIZi — Ail — E?|| Z; — Al? = 21A + Aalla. (10.45) 





An upper bound on its standard deviation is 


o = y Alli + Milla > Vl Aall? + Alla = v'varllZi — Dil? . (10.46) 


We look for a confidence interval of the form [0, u + 856] for some 85 > 0. 
As seen above, u = ||A;|; can be approximated by the sample mean y of 
(10.34) and, therefore, c can be approximated by 4/24? + y. In turn, the 


confidence interval becomes [0, y + 854/24? + 7]. Monte Carlo experiments 
show that £5 — 0.51 gives 9896 of the realizations within this interval. Thus, 
with a confidence level of cz = 9896, we can use the bound 


612 = y 4-0.51//25? +7 (10.47) 
in (10.29). 


Global Confidence Analysis 


There are q + 1 sets that are confidence regions. In the previous discussion, 
the bounds on the sets (10.15) and (10.29) were determined so as to obtain 
individual confidence levels of cı = c; = 98%. Using the analysis of [10], the 
global confidence level c on the feasibility set S = (7. , 5; satisfies 


c21-—(1—c«)- q(1-— c2). (10.48) 





In our experiments, which will involve q = 4 or q = 3 views, we shall thus 
obtain global confidence levels of c > 9096 or c > 92%, respectively. 


10.4 Numerical Simulations 


Most theoretical results in discrete tomography impose conditions on the 
shape of the objects to be reconstructed. Thus, experiments are usually per- 
formed on connected, convex, or even hv-convex objects [22]. Our algorithm 
does not require such stringent assumptions, and we shall use an original 
32 x 32 binary image 7 displayed in Fig. 10.1. This image features two discon- 
nected components, one of which is nonconvex. As in most of the experiments 
presented in [22], few views will be used, namely q = 4 or q = 3 views. 
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Fig. 10.1. Original image in the last experiment. 


10.4.1 Noise Simulation 


As discussed above, the noise corrupting the views in (10.10) is assumed to be 
Poisson-distributed. The pointwise variance of such a noise is directly related 
to the amplitude of the signal. On the other hand, the data acquisition process 
typically induces a multiplicative factor between the actual line sums and the 
measured views, due, for instance, to exposure time. In order to obtain a 
reasonable noise level, we set this multiplicative factor to 255. 

We now describe the methodology used for creating the noisy views. First, 
for each i € {1,...,q}, the exact line sum is computed and then multiplied 
by the proportionality factor 255 in order to generate the noiseless view LiT. 
Then, each point (LiT |e,) of the ith view is replaced by a realization of a 
Poisson variable with mean (L;x | ex) (see [16, 26] for the numerical simula- 
tion of Poisson noise). 

A typical noisy sinogram view is shown in Fig. 10.2. The SNR (signal-to- 
noise ratio) on the views varies from 31 dB to 33 dB. 





10.4.2 Binarization 


The binarization process consists of mapping an image in [0, 1]" into an image 
in {0,1}. The scheme we adopt here is straightforward: Since the algorithm 
produces an image in the set (10.11), each pixel value is rounded to 0 if the 
original value is less than 0.5, and to 1 otherwise. It is clear that binarization 
could be performed in a more sophisticated fashion, especially in the light of 
additional a priori information on the original image. 


10.4.3 Experimental Setup 


The algorithm described in Section 10.2 is implemented with y: æ  ||z||? in 
(10.3) (hence R is the identity matrix and r is the zero image). In other words, 


220 T.D. Capricelli and P.L. Combettes 


4500 
4000 
3500 
3000 
2500 
2000 
1500 
1000 

500 


0 
0 5 10 15 20 25 


Fig. 10.2. Example of a sinogram view: noiseless view (bold lines) and view with 
Poisson noise (thin lines). 


we seek the image with minimum energy in the feasibility set S = (7: Si. 
The m = q + 4 constraint sets to be used are 


(a) $1: pixel range; see (10.11). 
(b) S2: image support; see (10.12) and Fig. 10.3. 


Fig. 10.3. Support K used in (10.12) in the experiments. 
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(a) (b) 


Fig. 10.4. No noise, q = 4 views, without the total variation constraint. Before (a) 
and after (b) binarization (error: 6 pixels). 


(c) S3: sum of pixel values; see (10.15) and (10.39). 
(d) (Si+3)1<i<q: residual energy of the views; see (10.29) and (10.47). 
(e) S544 = Sm: total variation; see (10.17). 


To illustrate the benefit of using the total variation set Sm, experiments 
with the sets (S;)1«i«m-1 and (Si)1<i<m will be performed. Since they are 
easily computable in closed form, exact projections onto the sets 51, 55, and 
$3 are used. On the other hand, subgradient projections are used for the sets 
(Si)a<i<m- 


10.4.4 Numerical Results 


We first use q = 4 views at observation angles 0, 7/4, 7/2, and 37/4. In the 
case of noiseless views, the algorithm produces the image shown in Fig. 10.4. 

We now turn to the case of noise-corrupted views. Figures 10.5 and 10.6 
show two typical reconstructions produced by the algorithm, for two arbi- 
trary realizations of the noise. To illustrate the impact of the total variation 
constraint, we display the images obtained with and without this constraint. 

In order to test the variability of our results, several hundreds of tests 
were performed, using different realizations of the noise and various types of 
images. These experiments reveal that, for a given image, the number of wrong 
pixels does not vary significantly. This variability is quantified in Fig. 10.7 in 
the case of the original image of Fig. 10.1. 

Finally, we show how the algorithm behaves on a standard image from the 
binary tomography literature. This image, shown in Fig. 10.8, can be recon- 
structed uniquely from its exact (noiseless) horizontal and vertical views [22]. 
This theoretical result is, of course, no longer true in a noisy environment. 
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(c) (d) 


Fig. 10.5. Poisson noise, q = 4 views. Reconstruction without the total variation 
constraint: before (a) and after (b) binarization (error: 16 pixels). Reconstruction 
with the total variation constraint: before (c) and after (d) binarization (error: 8 
pixels). 


However, our algorithm reconstructs the image almost perfectly in the pres- 
ence of q = 3 noise-corrupted views at angles 0, 7/4, and 7/2 (see Fig. 10.9) 
with the 6 sets: 

(a) $1: pixel range; see (10.11). 

(b) S2: sum of pixel values; see (10.15) and (10.39). 

(c) (Si+2)1<i<3: residual energy of the views; see (10.29) and (10.47). 

(d) Ss: total variation; see (10.17). 
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(a) 


(c) 


Fig. 10.6. Same experiments as in Fig. 10.5 with a different realization of the 
noise. Reconstruction without the total variation constraint: before (a) and after (b) 
binarization (error: 12 pixels). Reconstruction with the total variation constraint: 
before (c) and after (d) binarization (error: 9 pixels). 










10.5 Concluding Remarks 


We have proposed a convex programming approach to the discrete tomo- 
graphic image reconstruction problem. Promising results have been obtained 
with a limited number of constraints and a quadratic objective function. The 
proposed algorithm can also handle a wide range of additional constraints as 
well as more general objective functions. The exploration of such extensions 
is left for future work. In particular, additional constraints could be derived 
from a better understanding of the noise process. Likewise, while total vari- 
ation appears to give good results, soft enforcement of binariness should be 
possible through alternative convex functionals. 
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Fig. 10.7. Experiments of Figs. 10.5 and 10.6: histogram of the number of wrong 
pixels based on 2000 realizations of the noise. 


Fig. 10.8. Original image. 
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(a) (b) 


Fig. 10.9. Poisson noise, g = 3 views. Reconstruction before (a) and after (b) 
binarization (error: 1 pixel). 
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Variational Reconstruction with 
DC-Programming 


C. Schnorr, T. Schüle, and S. Weber 


Summary. We present an approach to binary tomography by variational recon- 
struction and difference-of-convex-functions (DC) programming. Because we use a 
standard functional comprising a reconstruction error and a smoothness prior, the 
integer conditions are relaxed to convex box constraints. Complementing the func- 
tional with a concave penalty term allows a gradual enforcement of binary solutions. 
A DC-programming approach leads to an iterative reconstruction algorithm that is 
also applicable to large-scale problems. We show that hidden parameters, which 
model uncertainties of the imaging process, can be estimated as part of the varia- 
tional reconstruction. Besides presenting a concise overview over recent results, we 
also include novel results concerning the optimization performance of our approach. 


11.1 Introduction 


We consider the reconstruction problem of transmission tomography for bi- 
nary objects. As explained in Fig. 11.1, the imaging process is represented by 
the algebraic system of equations 


Ax — b, (11.1) 


where A € R™*” and b € R” are given, and where the binary indicator 
vector x € (0, 11" represents some unknown object to be reconstructed. This 
scenario arises in numerous application areas [14, 15]. 

Problem (11.1) is difficult for two reasons. First, the linear system is often 
underdetermined because, for instance, only a few projection directions are 
taken. In the absence of the binary constraint x € {0,1}", and for a large 
number of variables, this problem is often dealt with by iteratively determining 
a least-squares solution to (11.1) [5, 11, 17, 21], based on the convex feasibility 
problem [2] defined by the m equations (11.1). Another natural approach is to 
regularize the problem by picking out a particular feasible solution having an 
additional desired property. A typical example of such a property that is useful 
in many applications is to require a certain amount of spatial homogeneity of 
reconstructed objects, as measured, for instance, by 
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X1 





i4 











Fig. 11.1. Discretization model for binary transmission tomography. The measured 
projection data are given in terms of a vector b € R™. Each component b; cor- 
responds to a projection ray measuring the absorption along the ray through the 
volume, which is discretized into cells. The absorption a;,j in each cell is assumed 
to be proportional to the density of the unknown object. z1,22,... are binary vari- 
ables indicating whether the corresponding cells belong to the object (2, = 1) or 
not (zz = 0). Assembling all projection rays into a linear system gives Ax = b, 
x € (0, 1I", from which the unknown binary object, represented by x, has to be 
determined. 


S (zi — zj}? =: (x, Lx). (11.2) 


ijeE 


Here, E denote the edges of a regular grid graph, corresponding to a uniform 
discretization of the spatial domain (cf. Fig. 11.1), with nodes i € (1,...,n) 
indexing the components xi, i = 1,...,n, of x. Note that (11.2) defines a 
quadratic form with positive semidefinite matrix L. A prototypical approach 
to restoring underdetermined images or objects, based on (11.1) and (11.2), 
is to compute them as minimum of the functional 


1 
G(x) = z (IIAx — bl? + a Lx)), R 2 o» 0, (11.3) 


yielding a regularized least-squares solution to (11.1) [1, 6, 7]. The weighting 
parameter a allows us to control the smoothness of reconstructions. 

A second source of difficulties related to (11.1) is that we require x to 
be binary. On the one hand, this is motivated by the considerably reduced 
degrees of freedom that have to be determined from (11.1) (as opposed to 
an unconstrained vector x € R”). On the other hand, the feasible set (0, 1)? 
gives rise to the combinatorial integer programming problem 


G(x*)= inf G(x), 11.4 

CE en) (11.4) 

which, in general, cannot be solved in polynomial time. Rather, we have to 

confine ourselves to efficiently compute a good local optimum close to x”. 

Still, this is a challenging task for large-scale problems as they arise in 3D 
tomography. 
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The objective of this chapter is to give a concise account of our recent 
work based on a relaxation of problem (11.4), concave minimization, and 
DC-programming, where “DC” stands for “difference of convex functions.” 
While DC-programming usually refers to algorithms for global optimization of 
a certain class of nonconvex functions [16], which is feasible for small problem 
sizes only, our focus here is on local optimization of large-scale problems. 
Besides summarizing results from previous work [23, 24, 26], we also include 
novel results and experiments with respect to the optimization performance 
of our approach. 

Section 11.2 describes the relaxation and concave minimization part of our 
approach. In Section 11.3, we state conditions for local and global optimal- 
ity. The DC-programming part of our approach is detailed in Section 11.4. 
Our framework is sufficiently general to allow for various extensions. Of par- 
ticular interest is the ability to take into account uncertainty with respect 
to the model of the imaging process. Two examples in Section 11.5 show 
how corresponding parameters can be estimated by extending our approach 
accordingly. Section 11.6 reports experimental results obtained by applying 
our approach to discrete tomography and quantitative information about op- 
timization performance. Although this work has been motivated by binary 
tomography, we believe that our results are also of interest for other scientific 
problems involving large-scale quadratic optimization problems with binary 
variables. We conclude with corresponding remarks in Section 11.7. 

Our notation will be as follows: 






































ixl] Euclidean norm (>> ;_, a2)? of vector x 

(x, y) inner product $77 , vjy; of vectors x and y 
convex set (“box”) [0, 1]": x € B if 0 <a; <1, 
for all 4 

Ip(x) indicator function of the set 




















X 


Ig(x) = 0 if x € B, and Ig(x) = +00 if x ¢ 
Amin(M), Amax(M) smallest and largest eigenvalue of matrix M 








Diag(M) diagonal matrix with the diagonal of matrix M 
Diag(x) diagonal matrix with vector x on its diagonal 
e vector (1,1,...,1)7 € R” 

| Diag(e) (the n x n unit matrix) 

x* global optimum of Problem (11.4) 

Xp local optimum of Problem (11.6) 


11.2 Variational Approach 


Our approach to (11.4) is to relax the binary constraint x € {0,1}” to the 
convex constraint x € B, and to minimize the family of functionals 


Ju (xy) = inf J,(x), (11.5) 














where 


230 C. Schnorr, T. Schüle, and S. Weber 


O<weER, J,(x) := G(x) — uH(x), (11.6) 
G is defined in (11.3), and 
A(x) := 5x —e). (11.7) 


The following theorem explains the connection of problem (11.5) with the 
original problem (11.4). 











Theorem 1. /9, 16] Suppose that G is twice continuously differentiable on 
Then there exists uc € R such that, for all u > te, 





(i) Problem (11.4) and the minimization of J, are equivalent: 


Ju(x*) = inf J, 9). (11.8) 


(ii) J, is concave on [0, 1]". 


Theorem 1 asserts that for a sufficiently large value of the parameter p in 
(11.5), the global optima of (11.6) and (11.4) coincide. Assertion (ii), on the 
other hand, implies that not only does the global optimum satisfy the binary 
constraint in (11.4), but any local optimum too (all vertices of IB are local 
minima). This means that for such a large value u, problem (11.5) is as difficult 
as problem (11.4). 

For u = 0, however, minimization of the functional Jo = G over B is 
a convex optimization problem. Thus, the global optimum xo can be easily 
computed. This suggests the one-parameter family of optimization problems 
(11.6) for an increasing sequence of values 0 = u? < pt < --- « uF, u£ > 
Hc, and the computation of a corresponding sequence of local optima [x,J. 
Because we start with a global optimum xo and use as starting value i in 
order to determine x there is some hope to determine a final local optimum 
p that is close, if not identical, to the global optimum x* in (11.4). This 
approach, akin to numerical continuation or homotopy methods for studying 
general nonlinear systems, is the subject of the present chapter. 

We introduce some further notation in connection with (11.1), (11.3), and 
(11.6), which will be convenient later on: 


























1 
Jax) = Pins Q,x) — (qu, X) + const.; (11.92) 
Q; := ATA + aL — pl; (11.9b) 
qu := ATb — Ze. (11.9c) 


Assumption. Throughout this chapter, we assume the symmetric matrix Qo 
to be positive definite. 

This assumption is natural because the matrix ATA is symmetric and pos- 
itive semidefinite. Moreover, as the nullspace of L in (11.2) contains constant 
grid functions only, its intersection with the nullspace of ATA is typically 
empty. 
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Corollary 1. The value uc of the parameter u in Theorem 1 is 


Hc = Amax(Qo). (11.10) 


11.3 Optimality 


Problem (11.5) belongs to the class of quadratic optimization problems [8]. 
While the objective function is nonconvex for u > Amin(Qo), the constraints 
x € B are convex and therefore simple. 

In Subsection 11.3.1, we state conditions for local optimality. These are 
important for the design of optimization algorithms, because they have to 
be satisfied after convergence. In Subsection 11.3.2, we briefly address the 
much more difficult problem of global optimality. We confine ourselves to 
reporting from the literature necessary and sufficient conditions that can easily 
be checked in practice. T'hey provide the basis for corresponding experimental 
results discussed in Subsection 11.6.2. 














11.3.1 Local Optimality 











Minimizing the functional (11.6) over the convex set B leads to the first-order 
necessary conditions for a local optimum x,,. 





























-VJ (xp) € Na(x,), (11.11) 
where Ng(x,,) is the normal cone to B at x, [22]. Due to the simple structure 
of B = [0, 1]"^, condition (11.11) reads more explicitly: 

<0 if(x,); — 1, 
(V4.6), 4 20 if (Kp): = 0, (11.12) 
=0 if(x,)i € (0,1). 


This condition is sufficient in the convex case u < Amin (Qo), and also in the 
nonconvex case if x,, is nondegenerate [8]. The latter means that no component 
of the gradient (VJu(x,)); vanishes if a corresponding constraint is active: 
Ti E {0, 1}. 


11.3.2 Global Optimality 


The following conditions, to be read elementwise, can be numerically checked 
in practice: 


Theorem 2. /3] If x, € {0,1}" and 


Fmi (Qu)e > (Diaz) - 51) (Qux, ae), (1113) 
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then x,, = x*. Conversely, let x, = x*. Then 


zDiag(Q,)e > (Diag(x,) — 31) Qux, — qp). (11.14) 


Condition (11.13) is sufficient for x, to be globally optimal. It can be used 
to check a posteriori whether a local minimum x,, is also a global minimum. 
Conversely, (11.14) is a necessary condition for global optimality. Thus, if x, 
violates (11.14), then it cannot be the global optimum. 


11.4 DC-Programming 


In this section, a series of algorithms is developed for solving (11.8) and (11.4). 
Each algorithm utilizes the family of optimization problems (11.5) together 
with a particular decomposition of the functional (11.6). 

Subsection 11.4.1 describes the overall iterative algorithm based on (11.5). 
At each iteration, a quadratic functional is locally optimized by DC-pro- 
gramming. Subsection 11.4.2 describes the DC-framework and two particu- 
lar instances related to (11.6). Results of a numerical evaluation of these two 
approaches are reported in Section 11.6. 


11.4.1 Overall Algorithm 


The following algorithm computes a locally optimal solution to (11.8). 


Algorithm 1 Compute a local optimum to (11.8). 

Require: 0 € u < Amin(Qo) 

Require: ó, > 0 {y-increment} 

Require: £ > 0 {termination criterion: x is approximately binary} 


























1: x € global minimum of J, = G in B (see (11.3), (11.6)} 
2: repeat 

3: u-«€—pucó, 

4: x< local minimum of J, in B (see (11.6)} 

5: until zi Z [e,1— e], for alli=1,...,n 

6: return x 


Note that the first computation of x is done by computing the global minimum 
of a convex quadratic functional. For increasing values of u, however, the 
optimization problem in line 1 becomes nonconvex. Algorithms for solving this 
problem are described in Subsection 11.4.2. Algorithm 1 terminates according 
to Theorem 1. 

In Subsection 11.6.2, we numerically assess the optimization performance 
of Algorithm 1. If it returns the global optimum, then this is also the solution 
x* of (11.4). 
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To determine a starting value of u, we compute Amin(Qo). Because Qo is 
positive definite and the computation can be done offline, we apply the power 
iteration [10] to the matrix Qj !. The inverse does not have to be computed 
explicitly. Just a stable linear system with Qo as matrix has to be solved in 
each step. For large-scale problems, this also can be done iteratively [12]. 

Determination of a value for 6, is based on the following consideration. 
Having computed the global optimum of J„ in line 1 of Algorithm 1, the 
components x; of x lying in the interior of B are assumed to be most critical 
for computing a good local minimum as y varies. Consequently, we determine 
Ô, so as to bound the initial “speed” of these values. 

To simplify the analysis, we consider the situation in which x is in the 
interior of B, and that u = 0 in line 1. Then (11.11) reduces to VJo(x) = 0. 
Because Qo is positive definite, we have 


x = Qz !qo. (11.15) 


Let x’ = x+ ôx be the first local minimum, computed in line 1, corresponding 
to the parameter with u’ = 0+6, = ô„. Our objective is to bound the shift ôx. 
To this end, we ignore again the constraints B and assume Q,, to be invertible 
as well: 






































x! = Qqy. (11.16) 
Using the first-order expansion in ô, (see the Appendix), we obtain 
-1 1 
lôxl] < ó,lIQo Illl- Sell. (11.17) 
We propose choosing 6, such that |óz;| < €, on average. This yields 
1/2) ; 
5, < min) (11.18) 
Ik - zel 


This result is consistent with the above assumption that Q, = Qs, is invert- 
ible provided 6, < Amin(Qo). As a rule, this is true in real applications, in 
which x sufficiently deviates from e. The analysis does not apply for arbi- 
trary values of u, however, because Q, becomes singular if u is equal to some 
eigenvalue of Qo. Still, the result is plausible because the actual computation 
corresponding to line 1 — see Subsection 11.4.2 — avoids any singularities 
and is based on convex optimization of functionals with quadratic forms in 
Qo, independent of the value of u. 


11.4.2 DC-Optimization 


In view of functional (11.6) and problem (11.8), we consider the family of 
optimization problems 


inf {g(x) - h(x)). (11.19) 


where both g and h are convex and continuously differentiable. The following 
proposition follows from general results described in [20, 23]. 
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Proposition 1. The sequence (x^) of global optima of the convex optimiza- 
tion problems defined by 


x"*! € argmin [G9 — (Wh(x*),x) } (11.20) 
xcB 

is decreasing and converges to a point x satisfying the necessary conditions 

for a local minimum of (11.19): 


Vh(x) € Vg(x) + Na(X). (11.21) 


Consequently, Algorithm 1 gives rise to the following: 


Algorithm 2 Compute a local optimum to (11.8). 
Require: 0 € u < Amin(Qo) 
Require: ó, > 0 {y-increment} 
Require: £ı > 0 {termination criterion: x is approximately binary} 
Require: £2 > 0 {termination criterion of the DC-iteration} 

1: x € global minimum of J, = G in B (see (11.3), (11.6)} 

2: repeat 

3: u-«e€npucó, 

4 repeat 

5 x =x 
6: x <= global minimum of g(x) — (Vh(x’), x) in B {see (11.20)} 
7: until (g(x') — h(x’)) — (g(x) — h(x)) € ez 
8: 
9: 





























until x; ¢ [£1,1 — £1] , for all i = 1,...,n 
return x 


Depending on the decomposition of (11.6), Algorithm 2 can be applied to 
(11.8) in different ways. We consider the following two alternatives. 


I. Choose R 3 À > Amax(Q,,) and 


A A 
g(x) := 5 Ixl? » A(x) := zixl? - Iu(x). (11.22) 
Based on this definition, (11.20) corresponds to the sequence of convex 
programs 
1 
x^^! € argmin [ol e Value) Y (11.23) 
xcB 


II. Choose [cf. (11.6)] 
g(x) := G(x), h(x) := uH(x). (11.24) 
In this case, (11.20) corresponds to 


x^ n [669 — nV H(x^),x) V. (11.25) 
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After convergence, either version satisfies (11.21), which then corresponds to 
(11.11). 

Decomposition (11.22) leads to a straightforward optimization algorithm 
based on sparse matrix-vector products and thresholding [23], but with slower 
convergence. Decomposition (11.24), on the other hand, leads to more in- 
volved iterative steps (11.25) that are still efficiently solvable [4], and with 
faster convergence. Other decompositions are conceivable but have not been 
investigated so far. 


11.5 Extensions 


In real applications, the mathematical model (11.1) of the imaging process is 
often not precisely known. As such uncertainties may considerably affect the 
reconstructions, they have to be taken into account whenever possible. This 
can be done if a parametric model of the uncertainties is known. 

An example is given by [24] 


Ax + voA(e — x) = b, (11.26) 


which generalizes (11.1) to include two unknown absorption coefficients. Other 
examples are [26] 
AG,x = b, (11.27) 


and 
Gc Ax = b, (11.28) 


which take into account that projections were taken from blurred objects 
or that the projection data themselves were blurred, respectively. Here, Go 
denotes a block-circulant matrix representing the blurring operation with a 
Gaussian and unknown scale parameter c. 

In either case, Algorithm 2 can be extended so as to incorporate parameter 
estimation, during reconstruction, by means of the well-known expectation- 
maximization (EM) approach [18]. While the actual computations for setting 
up the respective extension are application-dependent [24, 26], the general 
principle is the same: Function g(x) in (11.20) is modified through integration 
over the space of unknown parameters (i.e., marginalization in the language of 
probabilistic inference), to become a convex function g(x; X) (E-step). Here, 
X denotes the reconstruction based on the current parameter estimate. Both 
parameters and reconstruction are iteratively updated, as part of the DC-loop 
(M-step). Algorithm 3 provides these steps in detail. 


11.6 Experimental Results 


In Subsection 11.6.1, we illustrate our approach by applying it to binary dis- 
crete tomography. Performance from the viewpoint of optimization is assessed 
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Algorithm 3 Locally optimize (11.8), estimate unknown parameters. 
Require: 0 € u < Amin(Qo) 
Require: ó, > 0 {y-increment} 
Require: £ı > 0 {termination criterion: x is approximately binary} 
Require: £2 > 0 {termination criterion of the DC-iteration} 
Require: £3 > 0 {termination criterion of the EM-iteration} 
x < global minimum of J, = G in B {see (11.3), (11.6)} 
repeat 
wept ou 
repeat 
x =x 
repeat 
xX=x 
x < global minimum of g(x; X) — (Vh(x’),x) in B {EM-update} 
until ||x — x|| € £3 
10: until (g(x') — h(x’)) — (g(x) — h(x)) € e 
11: until zi £ [e1,1 — ei], for all i = 1,...,n 
12: return x 


























e 





in Subsection 11.6.2. Finally, we illustrate in Subsection 11.6.3 the extensions 
described in Section 11.5. 


11.6.1 Discrete Tomography 


To illustrate Algorithm 2 with Decomposition I (11.22), Fig. 11.2(a) shows 
a 2D image from which three projections within 90 degrees were taken. The 
other images in Fig. 11.2 show intermediate results of the iteration and how the 
binary constraint is gradually enforced. The final binary result [Fig. 11.2(f)] 
shows that although the initial solution of the convex problem for = 0 is 
far from the original image, the algorithm is able to reconstruct the original 
image without error. 

To illustrate the application of our approach to a sizable 3D reconstruction 
problem, we used a binarized medical data set of size 256°. This volume was 
calculated as the difference of two volumes based on a standard computer to- 
mography reconstruction. The first volume was the reconstruction of a human 
head without, and the other volume with a contrast agent. As a result, the 
binarized difference volume shows the tree structure of a vessel system inside 
a human head. 

For this binary volume, we simulated 5 projections with angles 31.5°, 48°, 
66°, 84°, and 103.5°. The reconstruction was calculated with Decomposition 
II (11.24). The parameter a was set to 1, and the increment 6,, was computed 
according to Eq. (11.18) with e = 1. We eliminated beforehand from the opti- 
mization procedure all redundant voxels with projection value equal to zero by 
setting them to the fixed value zero. The remaining number of unknowns was 
279,074, and the number of projection constraints was 485, 791 [Fig. 11.3(a)]. 


11 Variational Reconstruction with DC-Programming 237 






(a) Original 64x64 image (b) Result for u = 0 


(d) u = 0.06 (e) u = 0.11 (£) Result after termina- 
tion 

Fig. 11.2. (a) Original image (64 x 64). (b)-(f) Reconstructions from three projec- 

tions (0°, 45°, 90°). Each image shows the reconstruction result after termination 

of the DC-iteration (cf. Algorithm 2, lines 4-7) for a corresponding value of y. 


While this relationship of unknowns/constraints seems to indicate that the 
reconstruction problem is easy, this is essentially not the case. Even standard 
tomographic reconstruction employs regularization (filtered backprojection) 
while using much more constraints — 133 projections over 200° in the present 
example. Consequently, regularization is all the more crucial when taking five 
projections only. This is effectively done by our approach by reducing the 
degrees of freedom of the reconstruction to those of binary volume functions. 

The algorithm was able to reconstruct the ground truth volume [Fig. 11.3 
(b)] within a run time of about 1900 sec on a dual 2.4GHz AMD Opteron 
system. 


11.6.2 Optimization Performance 


We assessed the optimization performance of our approach using a 64 x 64 
image shown in Fig. 11.4, from which three projections were taken at 0°, 
45°, and 90°. We denote the corresponding vector with x*, and the final 
reconstruction x,,,,, With x = x,,,,,. The regularization parameter a was set 
to 0.5. The -increment ô, was computed using Eq. (11.18) with £ = 1. 
Note that in this subsection, we use the symbol x* to denote the ground 
truth depicted in Fig. 11.4. We do not know, of course, whether this also 
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(a) Peel volume (b) Reconstructed vessel system 


Fig. 11.3. Variational binary reconstruction of a vessel system in a discretized 
volume of size 256?. (a) Locations of unknown variables corresponding to nonzero 
projection constraints. (b) The reconstructed vessel system. 





Fig. 11.4. Ground truth of a 64 x 64 reconstruction problem with three projections 
within 90?. 


corresponds to the global optimum. It is reasonable to assume, however, that 
it is quite close to it. Our results shown below confirm this and thus yield useful 
quantitative insights into the optimization performance of our approach. We 
preferred this way over experiments of very small size, for which the global 
optimum can be computed as a reference by exhaustive search. 

Figure 11.5 displays the error ||Ax — bļ||2, and Fig. 11.6 shows the corre- 
sponding run times, as a function of a factor s > 1 specifying various coarser 
schedules in terms of a p-increment sd, that was larger than specified by 
Eq. (11.18), and that was fixed for each schedule. Both decompositions were 
able to reconstruct x* for fine schedules. This also confirms the validity of the 
estimate (11.18), which seems to be a bit conservative but safe. For coarser 
schedules the run time decreases, of course, but the error increases as well. 
Moreover, Decomposition II converges faster, that is, a smaller number of it- 
erations necessary to converge compensates by far the additional costs in each 
step, in comparison to Decomposition I. 
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Fig. 11.5. Error ||Ax — b|[2 for different schedules. 
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Fig. 11.6. Run times for different schedules. 


11.6.3 Extensions 


Figures 11.7 and 11.8 show results computed with Algorithm 3 for the joint 
reconstruction-parameter estimation problems (11.27) and (11.28), respec- 
tively. These results prove the stability of our approach and enables us to 
cope with such inverse problems. 


11.7 Conclusion 


We presented an overview over recent results obtained with a novel approach 
to binary tomography. Ingredients of this approach are (1) a least-squares 
error criterion for reconstruction, (2) a variational prior enforcing spatially 
homogeneous solution, (3) relaxation of 0/1-constraints and adding a concave 
penalty term, and (4) iterative optimization by DC-programming. The ap- 
proach is essentially free of tuning parameters and applicable to large-scale 
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(b) Without deblur- (c) Without deblur- 


(a) Original ring (5 proj.; c = 1.0) ring (5 proj.; e = 2.0) 























(d) With deblurring (e) With deblurring (£) With deblurring 
(3 proj.; e = 0.5) (5 proj.; e = 1.0) (5 proj.; o = 2.0) 


Fig. 11.7. Reconstruction from blurred objects. (a) Original image (32 x 32) con- 
volved with a Gaussian kernel (c € [1.0,2.0)) and projected along five directions 
(0°, 22.5°, 45°, 67.5°, 90°). (b) and (c): Reconstructions without deblurring. (e) and 
(f): Reconstructions with deblurring. (d) Reconstruction from three projections only 
(0°, 45°, and 90°). Parameter o = 0.01 in all experiments. 


3D reconstruction problems. Experimental results demonstrate promising op- 
timization performance. The reconstruction process can be extended to in- 
clude the estimation of parametric models of the imaging process. 

Our further work will focus on various applications of binary tomography, 
and on the inclusion of prior knowledge about object structure beyond the 
elementary property “spatial smoothness.” 
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Appendix: Derivation of (11.17) 
Applying Woodbury's formula [10], we compute 


y = (Qo — ul)? -Q; 5,05 (5,057) 0129) 
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(d) Blurred projections (c = 1.5) 
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(c) Reconstruction 
(c = 1.5, a = 0.05) 
































(e) Reconstruction (f) Without deblurring 
(c = 1.0,a = 0.01) (c = 1.0,a = 0.01) 


Fig. 11.8. Reconstruction from blurred projections. The image shown in (a) was 
projected along 0°, 45°, 90°, 135? and convolved with a Gaussian kernel (o = 1.5). 
(b) and (d): Correct and blurred projection data, respectively. (c) Reconstruction 
from the data shown in (d), a = 0.05. (e) Reconstruction from projection data that 
were blurred with ø = 1.0. The accuracy of estimating the unknown parameter c 
during the reconstruction is about 696. (f) Erroneous reconstruction without taking 
deblurring into account. 


Because à,, < Amin(Qo) — see (11.18) and the corresponding discussion — we 
have ||9,Qg ! || = ó,||Qg "|| < 1 and, as a consequence, the valid representation 
(Neumann series) 
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Qu = Qu + Su Qo (Saa ») Qj; ~ Qog (l+ SQD t). (11.30) 


k=0 





Insertion into (11.16) and ignoring the quadratic term in 6,, give, using (11.15), 
~o-! zi Õu -1 1 
x + 6x & Qg (l+ 5,Qq")(do - 7e) ~ x + 6,Qy (x — 7e). (11.31) 


Taking the norm and the Cauchy-Schwarz inequality result in (11.17). 
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Part III 


Applications of Discrete Tomography 


12 


Direct Image Reconstruction-Segmentation, as 
Motivated by Electron Microscopy 


H.Y. Liao and G.T. Herman 


Summary. Our aim is to produce a tessellation of space into small voxels and, 
based on only a few tomographic projections of an object, assign to each voxel a label 
that indicates one of the components of interest constituting the object. Examples of 
application are in the areas of electron microscopy, industrial nondestructive testing, 
cardiac imaging, etc. 

Current approaches first reconstruct the density distribution from the projec- 
tions and then segment (label) this distribution. We instead postulate a low-level 
prior knowledge regarding the underlying distribution of label images and then di- 
rectly estimate the label image based on the prior and the projections. In particular, 
we show, in the binary (i.e., two labels) case, that the marginal posterior mode esti- 
mator outperforms the widely known maximum a posteriori probability estimator. 

As measured by label misclassification in the reconstructions, our direct labeling 
method was experimentally proved (in the binary case) to be superior to current ap- 
proaches. However, when a detectability measure was used, its relative performance 
was less satisfactory. We discuss possible improvements. 


12.1 Introduction 


The essence of discrete tomography is to make use of the knowledge that 
the reconstruction should contain only a few values to make up for the lack 
of availability of a large number of projections [25]. For example, in indus- 
trial nondestructive testing, each voxel in the reconstruction can be labeled 
as containing either (the presence of) the scanned object or air; the number 
of projections should be small for containment of cost. Another example is in 
electron microscopy of biological macromolecules, in which the labels (com- 
ponents) could be, e.g., ice, protein, and RNA; and because of the damage by 
radiation, only a very limited dose can be applied to the specimen. 
Traditionally, in all applications that use a discrete tomography method, 
a label is determined by the density within the corresponding component. 
Under this assumption, partial volume effects along the boundaries between 
components are ignored. Examples of application areas are cardiac imaging 
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Fig. 12.1. Histograms of the densities corresponding to volumes sampled using 
voxels of edge length equal to 2.5 (left) and 7.5 (right), obtained from volumes 
composed only of ice, only protein, or only RNA; from [7]. 


[8, 11, 41], angiography [14], attenuation map correction [1, 2, 11, 13], etc. 
In contrast to these applications and in addition to the partial volume effect, 
in electron microscopy the range of densities that may occur in a voxel may 
overlap for the different labels that may be assigned to that voxel. As the 
resolution increases, this overlap of densities becomes higher, as indicated by 
the histograms of densities in Fig. 12.1. 

'The understanding of the macromolecular mechanisms of the key cell func- 
tions has great impacts; for example, it may be essential for the discovery of 
a new drug. Since these mechanisms are highly correlated with the three- 
dimensional (3D) structure, an accurate reconstruction of the 3D volume is 
vital for the inference of the biological functions [16, 30]. To improve on ex- 
isting reconstruction methods that ignore the phenomenon indicated on the 
left of Fig. 12.1, we propose new techniques directly applicable to the case of 
overlapping density ranges. 

We postulate that Gibbs priors [48] enable us to model the underlying dis- 
tribution of label images. Such priors, which allow us to “encode” the general 
shapes and sizes of characteristic structures [33, 37], need to be learned from 
a training set of typical correctly labeled images [33]. Our approaches are sta- 
tistical: The reconstruction is considered to be a minimizer of a Bayes risk. 
A popular choice for the optimizer is the well-known maximum a posteriori 
probability (MAP) estimator [48]. However, we also study the marginal pos- 
terior mode (MPM) estimator, as it has been suggested [15] that, for image 
denoising purposes, it would not cause “oversmoothing,” as would the MAP 
estimator. Here we show experimentally that this is also the case in image 
reconstruction. 

In Section 12.2, we introduce our statistical model, and in Section 12.3, we 
define the two estimators (MAP and MPM). Because of the complicated form 
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of the true estimators, in Section 12.4, we propose approximations to both of 
them, and in Section 12.5, we introduce our algorithms that search for the 
approximated estimators. All the experimental results, shown in Section 12.6, 
are demonstrated for the binary (i.e., two labels) case for 2D and 3D images. 
A methodology for objective comparison of algorithms is also treated. We 
make some discussions and give the conclusions in the last section. 


12.2 The Model 


Based on a Bayesian viewpoint, we combine the information coming from the 
measurement data with the Gibbs prior and then minimize a Bayes risk that 
is defined to be the expectation of some loss function with respect to the 
posterior probability. The posterior probability is proportional to the product 
of the (Gibbs) prior probability and the likelihood that is defined to be the 
conditional probability of the measurement data given the label image. The 
optimizer of the Bayes risk (or Bayes estimator) is adopted as the solution 
of the reconstruction problem. Our model is hierarchical: At the bottom are 
the label images, which are assumed to obey a Gibbs probability distribution; 
on top of the label images, we have the gray-value (or density) images, which 
model the uncertainty of the density at each voxel. Finally, on top of the gray- 
value images are the measurements, which are defined to be a noisy version of 
the projections of a gray-value image. Our task is to estimate the label image 
from the measurements. 


12.2.1 The Label Image 


For simplicity in the discussions, we concentrate for now on two-dimensional 
(2D) label images. Later, in Section 12.6, we consider 3D images. Let D be the 
domain, which is defined to be a fixed nonempty square subset of the square 
lattice (i.e., 

D = {(v1,v2) E Z? | 0€ vv; <V}, (12.1) 


where Z denotes the set of all integers). An element d € D is called a point in 
D. Any nonempty subset of D is called a clique. Given a clique q, a con- 
figuration (over q) is defined as a mapping from q into the set of labels 
A = (A,22,..., Ag}. In the special case where q = D, a configuration 
9g: D — A is called a label image and will be denoted by x. Sometimes, 
for computational purposes, it is convenient to express images as vectors. 
Thus, we allow x to denote either an image with domain D or a column 
vector of dimension |D| = V?; i.e., the elements of the corresponding vector 
TES (z0. 2D) are defined by xy,v4v, = (v1, v2), 0 € v1, v2 < V. 
Since the Voronoi neighborhood [23] on the plane of a point in Z? is a unit 
square (commonly referred to as a pizel), we also call a label image a labeling 
of the set of pixels corresponding to D. 


250 H.Y. Liao and G.T. Herman 


We assume that there is a prior distribution that assigns to every label 
image x € A? a probability 7 (x). Typically, this prior distribution is a Gibbs 
distribution (GD), which means that 


n(x) = Z~ exp[—H (x)], (12.2) 


where 7 (a) is the probability of occurrence of the image x, Z is the normal- 
izing factor, and H (x) is referred to as the energy of x (see, e.g., [48]). The 
energy is the negative of sum of local potentials, each of which is a real num- 
ber assigned to the configuration on a clique. As a result, m (x) is a product 
of terms, each of which depends on only one clique. Typically, the local po- 
tential depends on the labels in the cliques in a way that is invariant under 
various transformations (such as translations, rotations, and reflections); this 
results in having only a few possible values for the local potentials, which 
are the parameters of the GD. In the examples with 2D images, we consider 
two labels [ie., | A] = 2, and the labels are black (or 0) and white (or 1)] 
and use a GD that has five parameters. Each parameter corresponds to 3 x 3 
clique configurations that are considered to contain one of the following five 
“local features": a black region, a white region, an edge, a convex corner, or 
a concave corner. (See Fig. 12.2. More details can be found in [33].) To see 
how the general shapes and sizes of characteristic structures in images can 
be “encoded” by these parameters, Fig. 12.3 shows typical random samples 
from four GDs, whose parameters were conveniently adjusted for illustration 
purposes. 

For electron microscopy applications, we can think of ice as corresponding 
to black and protein to white. The nature of the boundaries between ice an 
protein is modeled by the local features. To obtain the parameters of the GD, 
there are standard techniques for estimating them from typically correctly 
segmented images (see, e.g., [31] for a survey). In [33], we applied the best- 
known techniques to our models with the five parameters. Here we use the 
estimation method we found to be the best (the pseudo-likelihood method 
[3, 4]) in these experiments. 


edge edge 


black convex convex convex concave concave concave 
region comer comer comer comer comer comer 

white concave concave concave edge edge convex convex convex 
region comer comer comer comer comer comer 


Fig. 12.2. Configurations of a 3 x 3 clique that specify local features referred to as: 
à black region, a convex corner, a concave corner, an edge, and a white region. 
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Fig. 12.3. Sample images (63 x 63) from four different Gibbs distributions modeled 
by five parameters, each one of which is assigned to a local feature of Fig. 12.2. 


12.2.2 The Gray-Value Image 


Let Y be a set of gray-value images, each of which is an /-dimensional vector 
y = (yo,---;Yyr—1)*, where y; € Y (the set of gray values, Y C R), for 0 < 
i < I — 1. We assume that there is a conditional distribution that, given a 
label image x, assigns a probability ¢(y|x) to every gray-value image y. [Since 
Y is not necessarily finite, it would be more precise to say that ¢(y|x) is 
the probability density function defining the conditional distribution of the 
gray value image y given the label image x. For the sake of brevity, we will 
continue to refer to a notation such as ¢(y|x) as a “probability” rather than as 
a "probability density function."] Unless otherwise stated, we work with the 
special case in which J = |D| and, for every label z;, there is a distribution 
that assigns (independently) a probability y(y;|x;) to every gray-value y;, for 
0 € i € I — 1. Consequently, 


I—1 
(yl) = TT evil) (12.3) 
1=0 


For example, in electron microscopy applications, a gray value y; is the 
density of the Voronoi neighborhood of the lattice point i, for 0 < i < I— 1; 
and the probabilities y(y;|x;) intend to model the histograms in Fig. 12.1; 
ie., the uncertainty of the density around the average density, denoted by 
[bz,, Corresponding to the label z;. 


12.2.3 The Measurements 


Let W be a set of measurement vectors, each of which is a J-dimensional 
column vector w = (wo,...,wj-1)', where wj c W CR, fon0x j € J- I. 
We assume that there is a conditional distribution that, given a gray-value 
image y, assigns a probability x(w|y) to every measurement vector w. Usually, 
w is related to a linear transformation z of the gray-value image 


z= Ry, (12.4) 
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where R is of size J x I with fixed coefficients and none of the rows of R is a 
null vector. Here two cases are studied: one in which R is the identity matrix 
(thus, J = I) and the other one in which R is the projection matriz [21]. 

The first case is based on current approaches of first reconstructing the 
gray-value image followed by a segmentation. If the gray-value reconstruction 
were perfect; i.e., the measurement is the gray-value image itself (w = y), then 
the resulted segmented image would be the best one can expect from current 
techniques. This idealistic situation provides us with an “upper bound" on 
the quality of the label reconstruction by a current approach. 

'The second case reflects real tomography applications, in which the mea- 
surement data are a set of projections, each of which is usually modeled as 
the measured (hence, noisy) integrals of the density of the object being re- 
constructed along lines that either are parallel to each other or share a point. 
In our framework, this can be modeled by setting the entry rji of R to be the 
length of intersection between the line 7 and the Voronoi neighborhood of the 
point i. 

For the experiments reported here, we work with the special case in which 
x(w|y) has a product form; i.e., given a gray-value image y, there is (for 
0 € j € J — 1) a conditional probability v;(wj|y) of the jth measurement 
being w;, and 


J-1 
x(wly) = T ey (wyly)- (12.5) 
j=0 


12.2.4 The Posterior Probability 


The likelihood 7(w|x) of measuring w given a label image x, assuming (as 
reasonable) that w does not depend on z given y, is the integral (or sum if Y 
is discrete) 


n(w|at) = J x wly) (yla)dy. (12.6) 


By invoking Bayes’ rule, the posterior probability 0(z|w) of x given the mea- 
surement w is proportional to the product of the prior probability m(x) and 
the likelihood n(w|x): 


6(z|w) = MT e n (a)n(wl|x). (12.7) 


12.3 Optimization Criteria: The MAP and the MPM 
Estimators 


Because the exact or perfect solution rarely exists, inexact but optimal (in 
some sense) solutions are usually sought instead. Since the model for the 
prior as well as the model for likelihood of the data are available to us by 
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assumption, it seems natural to seek a solution from the Bayesian point of 
view. In Bayesian estimation theory, estimators are studied in terms of loss 
functions [48]. 

For a given measurement w, let z(w) denote an estimator that hopefully is 
“close” to the unknown zx. The loss of estimating a true x by ? is measured by 
asymmetric “distance” or loss function C (x, 2) > 0 with the convention that 
C (z,z) = 0. Given a loss function C (x, $), a posterior probability 6(z|w), 
and a measurement vector w, the Bayes risk is defined as 


RC, 3, w) = XO C (s, 8(w)) 6(a|w). (12.8) 


xEAP 


An estimator (w) that minimizes this risk is known as a Bayes estimator. 
Two much-used loss functions are the all-or-none loss function [Co (x, $), 
which equals 0 if 2 = x and is 1 otherwise], and the Hamming loss function 
[Co (x, 2), which is the number of points at which 2 and z differ]. The former 
gives rise to the popular maximum a posteriori probability (MAP) estimator 
z^ ^P (w) = arg max [0(x|w)] , (12.9) 

z€AD 

whereas the latter corresponds to the marginal posterior mode (MPM) esti- 
mator, whose ith element is the label that maximizes the marginal posterior 


Tj 


MPM — arg max (2; |w), (12.10) 
ric 


for i—0,...,[|D| - 1. 


12.4 Approximation to the Posterior Probability 


'Two challenges concern the estimation. First, computing the exact MAP or 
the MPM estimators implies, respectively, the finding of the maximum of the 
posterior probability or its marginal. Unfortunately, closed and explicit forms 
for these optima do not exist and neither do efficient deterministic algorithms 
for finding them. In such a situation, Markov Chain Monte Carlo (MCMC) 
[6] methods might be the only feasible recourse. In particular, for the MAP 
estimator, because of the nonlinearity and the nonconvexity of the posterior 
probability, the optimum cannot be obtained via local search techniques, un- 
less a suboptimal estimator is sought. We found that a local method, such 
as the one known as iterated conditional mode [5], yields results that are not 
nearly as good as those provided by our approaches. As for other local meth- 
ods, such as relaxation labeling [42], it has been suggested in [31] that it often 
produces solutions with risk that is well above the attainable minimal risk. As 
a consequence, we restrict our attention to global methods. In addition to non- 
convexity, the posterior (probability) is defined on discrete labels; therefore, 
a combinatorial optimization technique needs to be used. 
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The second challenge has to do with the fact that the complicated nature of 
the posterior (specifically, the likelihood term involves a multidimensional in- 
tegration) does not make possible an efficient sampling in the MCMC method; 
or at least we are not aware of such. Therefore, we investigated the existence 
of alternative approaches that not only can be efficiently implemented but also 
deliver good reconstructions. We propose the pseudo-likelihood (PL) approxi- 
mation. We also investigated the mean-by-the-mode likelihood approximation, 
but we do not show its results because they are not better than those produced 
by PL when noise is high (which is likely to be the case in our applications). 
It is, however, a good approximation only when the noise is low [32]. 

Prior to introducing the PL approximation, we study the case when the 
probabilities involved are normally distributed, because then there is a closed 
form for the likelihood, and the approximation error can be easily calculated. 


12.4.1 Normality Assumption 


In practice, it is not unreasonable to assume normally distributed gray values 
and measurements. Thus, in all the discussions (although some of them do 
not require this assumption) and the experiments, the probabilities ¢(y|x) in 
(12.3) and x(w|y) in (12.5) are restricted to be normally distributed. 

Let ó(y|v) be a normal distribution with mean vector 4, and covariance 
matrix Xz; i.e., 


é(y|x) = Normal (uz, Xz) (12.11) 


and Y = R. We work with the special case in which X, = diagg<j<y_1(07,) 
is positive definite. We assume that the measurement is a (Gaussian) noisy 
linear transformation, as in (12.4), of the gray-value image; i.e., 


x(w|y) = Normal (Ry, Xw), (12.12) 


where Xw is the covariance matrix of the noise; we consider the case in which 
Xw = diago<j< 1 (09) is also positive definite. 

It is convenient to view the gray-value image y and the measurement 
vector w as instances, respectively, of the associated random vectors y and 
w. Accordingly, (12.11) implies that 


y ~ Normal (us, Xx), (12.13) 


and (12.12) implies that 
w= Ry +n, (12.14) 


where n is the random vector associated with the measurement noise; i.e., 
n ^ Normal (0, Xw). 
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A result in probability theory [27] establishes that Ry is a normally dis- 
tributed random vector with mean Ru, and covariance RX, RI. Knowing this 
and assuming (as reasonable) that Ry and n are uncorrelated, we have the 
probability law for the likelihood 


n(w|x) = Normal (Ruz, RE,R' + X). (12.15) 


More explicitly, if we set pw = Ruz and X, = RX, R' + Xu, then 


nola) = L pgex-i(w-gufEXuiw- au), — 026) 


(2x) 15,4] 2 


where Xwg is also positive definite, because so are X, and X, by assumption. 


12.4.2 The Pseudo-Likelihood (PL) Approximation 


We approximate the likelihood in (12.6) by the pseudo-likelihood (PL) that is 
defined as 


J—1 
nez Qw|z) = Lila); (12.17) 
j=0 


where ¢;(w,|x), for j = 0,..., J—1, is the likelihood (probability) of observing 
wj given z; i.e., it is the marginal 


cj (w;|x) = / n(w'|x)dw', (12.18) 


j 


where 


W; = {w | w; = wz} (12.19) 


[the integral in (12.18) is replaced by a sum if W is discrete]. The pseudo- 
posterior 0pr (x|w) is obtained by replacing n(w|x) by npr (wlz) in (12.7). The 
introduction of the PL in (12.9) and (12.10) by replacing n(w|x) by npr(wl|x) 
in (12.7) gives rise to, respectively, the pseudo-MAP (P-MAP) estimator 


a? MAP — arg max pr (v|w), (12.20) 


and the pseudo-MPM (P-MPM) estimator 


QM = arg max pz (z;|w), (12.21) 


for i —0,...,1— 1. 

The PL approximation is justified when, e.g., the measurements are very 
noisy. It assumes they are statistically independent. This is valid in many 
tomography problem settings [24, 45, 46], but unfortunately it is not so in 


256 H.Y. Liao and G.T. Herman 


our framework, because of the dependencies coupled by the gray values. Intu- 
itively, however, it is reasonable to argue that these dependencies are rather 
“weak” when noise is high. We now establish a relationship between the true 
likelihood and the PL under the normality assumption. 
In this case, the marginal distributions c;(w;|x) in (12.18) becomes, noting 
(12.15), 
€ (wj|x) = Normal (Hw; 0 br;) ; (12.22) 


where, for 0 € j < J — 1, pw, is the jth element of jj, = Ry, and opr; = 
rir + 05 0, with r; being the transpose of the jth row of R. Hence, 
the PL becomes 

npi(w|xz) = Normal (tw, XPL), (12.23) 


where we have set Xpr, = diagoz j« jy 1(02r;)- A closer look tells us that X pr, 
is, in fact, the diagonal part of Xwx in (12.16); thus, we can set X; = Xpr—X, 
where the entries of R are the off-diagonal elements of 3,4 but with negative 
sign. If npr(w|x) were a good approximation to 7(w|x), then Xpz should be 
also a good approximation to X4. Assuming that SpR] < 1, one can think 
of Sar as the Oth-order approximation to X1 = (Xpr — qt! in the series 
expansion 


Xz} = Spt (Iy — SpLR) = Dpt (Iy + wee): (12.24) 
The condition | rep R| < 1 is generally true when, e.g., the noise level is high, 
which is the case in electron tomography; hence, the PL is a good approxima- 
tion to the true likelihood. 


12.5 Reconstruction Algorithms 


As already mentioned in Section 12.4, we resort to MCMC types of algorithms 
if we are interested in finding global optima. In particular, for MAP estimators, 
simulated annealing [9, 29] are suitable for searching in a high-dimensional 
discrete space. For the MPM estimator, the modes can be estimated simply 
by sampling the posterior probability. In fact, both simulated annealing and 
sampling can be carried out by MCMC algorithms. 


12.5.1 Metropolis Algorithm for Finding the MAP and the MPM 
Estimators 


Since simulated annealing can be considered a sequence of sampling algo- 
rithms, each of which is at a fixed temperature that slowly decreases according 
to an annealing schedule [18], our optimization task becomes a sampling task 
that can be carried out, e.g., via the Metropolis algorithm [6, 38]. In each step 
of this algorithm, the ratio ^ (x^?) /z(z C9?) is calculated, where y is the target 
distribution and z? and x") are, respectively, the current state (image) and 
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a “tentative” image for the next state. Here a) and a) differ at only one 
randomly selected point. The next state a”) is equal to a) with probability 
min{1,7(2")/y(a™)}. A cycle of the Metropolis algorithm consists of |D| 
(the size of the image) such steps. Since typically a large number of cycles 
is required, it is highly desirable that the ratio can be computed efficiently. 
The distribution y in our case is proportional to [r(x)npr (wje) "7: T either 
changes according to an annealing schedule (for MAP estimators) or is fixed 
at 1 (for MPM estimators). For efficiency, we adopt a strategy very similar 
to that in [47], in which the value of T is absorbed into the computation of 
(pre-generated pseudo-) random numbers needed during the sampling. Thus, 
we only need to be concerned with the distribution y itself for both estima- 
tors. In translating [47] to our problem, we have to precalculate and store all 
the possible values of the log-ratio of the likelihood term 


(n^) 
e ea UAE AMA d (12.25) 
nprQw|x o?) 


12.5.2 Implementation of the Reconstruction Algorithms 


Following the discussions in Subsection 12.5.1, here we concentrate on the 
target distribution Ópr(r|w) c m(x)npr(w|x) for both the MAP and the 
MPM estimators. Using (12.23), the log ratio in (12.25) under the normality 
assumption is 


ue (wis?) OPLj (x) 
TC log — Y 12.26 
og NPL (w|z 9) p». og OPL; (a (^?) ( ) 
RS ——— — (risen = wi)” 
jii; FO 205,; (2) 20i (z^?) l 


where z"? and «™) differ only at the ith point and, from Subsection 12.4.2, 
Opp = 1} XT) Om, (12.27) 


In order for the the log ratio in (12.27) to be precalculated exactly and 
efficiently stored, we let the elements of r; (for 0 < j < J — 1) have only 
a few (one, in our experiments) possible nonzero values (see later in Subsec- 
tion 12.6.2). Specifically, suppose that there are exactly T; nonzero elements 
of r;, all having a same value. Then, since we are dealing with binary images, 
there can be only T; + 1 possible values for the expressions riya), TI^; 
and r5 X,r; in (12.27) and (12.27). Each value depends solely on the number 
of grid points (ranging from 0 to T;) that are labeled black or white along the 
line corresponding to the jth measurement. Therefore, given w; and Gas , we 
need only to precalculate and store Tj + 1 values for the jth measurement. 
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12.6 Experiments 


Given some (2D or 3D) phantoms that are label images, we generated their 
corresponding gray-value images (one gray-value image for each phantom) 
and simulated the projections taken from the gray-value images. Our task is 
to estimate the label images from the projections and to evaluate the recon- 
struction quality, which in this case is indicated by the average percentage of 
misclassified labels as well as (in the 3D case) by the area under a receiver 
operating characteristic (ROC) curve [40, 49]. 

We tested our estimators and compared them with the iterated condi- 
tional mode (ICM) approach [5] and a current method [using an algebraic 
reconstruction technique (ART) [19, 21]] of first reconstructing the gray-value 
image and then segmenting it to obtain the label image. ART has proven 
to outperform several algorithms for various tasks [22, 28, 35, 44]. We also 
investigated the approach discussed in Subsection 12.2.3 in which we assume 
a perfect reconstruction of the gray-value image and then segment it using 
an optimal threshold. Both 2D and 3D reconstructions were studied, and a 
methodology for objective comparison of algorithms [17, 26] was applied. 


12.6.1 The General Setting 


In all the experiments we took binary (two-label) images: We used black (or 0) 
to represent ice and white (or 1) to represent protein. The product structure 
of ó(y|x) implies that for a label image, a gray-value image can be obtained by 
sampling at each pixel according to the probability y(y;|x;), for 0 <i < 1— 1. 
Normality is assumed in all the experiments. The y(y;|x;), for 0O <i € I — 1, 
is taken to be normally distributed with mean pz, and variance o?,, where 
pe: = ae Ho = 4, and uı = 9. However, for the projection data simulation, 
to avoid negative densities (gray values), the value of y at a pixel was set to 
zero whenever it was negative according to v(y;|v;); this happens only with 
probability less than 0.028 if z; — 0 and less than 0.01 if z; — 1. There is no 
particular reason for choosing these values of mean and variance, except for 
the fact that they reflect the idea of overlapping histograms in Fig. 12.1. 

The significance of the differences in reconstruction quality (as measured 
by the average percentage of misclassified labels) between two estimators is 
measured using the t-test, which produces the P-value [39]. A low P-value 
(usually less than 0.05) indicates that the null hypothesis that the two esti- 
mators have the same performance can be rejected in favor of the alternative 
hypothesis, which states that the respective qualities do differ. 


12.6.2 2D Experimental Details 


To assess the usefulness of Gibbs priors, we first took phantoms of size I = 
63 x 63, which are true random samples from a GD, namely, the one that 
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Fig. 12.4. Illustration on a 5 x 5 image of the chosen projection lines, so that the 
length of intersection of a line with a pixel is the same in the direction with tangent 
equal to (respectively from left to right) infinity, 1, 0.5, and —0.5. Ninety-degree 
rotations of these directions give the directions with tangents equal to, respectively, 
0, —1, —2, and 2. 


generated the leftmost image of Fig. 12.3. We then also considered label images 
that represent the cross section of a macromolecule. 

'The projections forming the measurement vector w were simulated as fol- 
lows. There were either four or eight projections using parallel lines in each 
projection, such that the tangent of the angle, denoted by a, between these 
lines and the “positive horizontal direction” is 0, infinity, —1, and 1 in the 
case of four projections and it is 0, infinity, —1, 1, —0.5, 0.5, —2, and 2 in the 
case of eight projections. We chose parallel lines so that the nonzero lengths 
of intersections of a line with the pixels are all the same in one direction; see 
Fig. 12.4. 

'This choice is not a necessary condition for validity of the our approaches, 
but it simplifies their implementation. For each experiment we formed a J- 
dimensional column vector z = (21,..., z7)! from the sums of the pixel values 
in the gray-value image along the J lines in all the projections. 

We chose v;(w;|y), for 0 € j € J — 1, to be normally distributed with 
mean zj, and we initially intended to set the variance to be On, = N-z; 
We call N the noise level, whose values were chosen to be 0.25,1.0, or 4.0. 
Since it is possible that z; = 0, which would not produce a positive definite 
(diagonal) covariance matrix Xw in (12.12), we replace o?,; by N - à; = 
N - max (uo, wj), where wj is sampled from v;(w;|y). The preceding is how 
we modeled q;(w;|y) in the algorithms. For the simulated noisy projections, 
to avoid negative w; we set wj = Wj, for j = 0,...J — 1. For a justification, 
see [32]. 

For the P-MAP estimators, we ran 5-104 cycles of the Metropolis algorithm 
for each temperature T', where 1/T ranged from 0.5 to 1.4 in increments of 
0.05. We did not observe significant improvements by increasing the range 
(from 0.5 to up to 2) of 1/T. For the P-MPM estimator, we took a total 
of 1000 samples for computing the mode, and each sample was obtained by 
running 2 - 104 cycles (which is sufficient for the burn-in). A mode based on 
1000 samples was practically the same as that based on 5000 samples. 
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Table 12.1. Percentage of Misclassification in the PL Approximation (N is the 
Noise Level) 


N-035 
P-MAP with 4 projections | 7.0 + 4.0 |9.4 + 5.1|14.9 + 6.5 
P-MPM with 4 projections] 5.4 + 2.2 |7.2 + 3.3|11.9 + 4.3 














P-MAP with 8 projections | 3.4 + 1.4 4.8 + 1.8] 9.8 + 3.5 
P-MPM with 8 projections] 2.8 + 1.0 |4.0 + 1.4| 7.7 + 2.7 

















12.6.3 2D Experimental Results 


Table 12.1 reports on the quality of the two estimators for the three noise 
levels. It demonstrates the superiority of the P-MPM estimator to the P- 
MAP estimator with P-values (see Subsection 12.6.1) that are less than 1074 
for all the noise levels. Figure 12.5 shows some actual reconstructions. 





Fig. 12.5. Reconstructions of label images from a true Gibbs distribution. The 
top image is a phantom. From left to right in the second row are its P-MAP and 
P-MPM estimates from four projections, and the P-MAP and P-MPM estimators 
from eight projections; all the reconstructions correspond to the noise level N = 0.25, 
and the number of misclassifications are 600, 391, 182, and 186, respectively. The 
bottom row, in the same arrangement, corresponds to N = 4.0, with the number of 
misclassification being 893, 719, 482, and 480. 
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Table 12.2. Percentage of Misclassification (N is the Noise Level) 


Method projections, N — 10 
Optimal thresholded gray-value image 6.8 + 0.2 
Optimal thresholded gray-value reconstruction 7.7 35 0.2 


9501 


P-MAP 4.1 3 0.3 


P-MPM 3.5 + 0.2 








We also reconstructed binary images that represent a cross section of a 
macromolecule in four conformations, based on eight projections and with 
noise level N — 1.0. Our approaches require the knowledge of the GD from 
which the unknown image is a typical sample. To that end, for each of the 
four images, we estimated the Gibbs prior parameters using the other three 
images. 


Fig. 12.6. A cross section of a macromolecule in four conformations. 


'The four conformations are depicted in Fig. 12.6, Table 12.2 reports on 
the quality of the estimators, and Fig. 12.7 shows actual reconstructions of 
one phantom. We note that in terms of misclassification, current techniques 
perform worse than our methods. However, our P-MPM estimator, which 
has the fewest misclassification, does not fully recover all the features in the 
original image. Therefore, a different quality indicator ought to be used, such 
as the area under a ROC curve (see our experiments with 3D images). We also 
note that, in general, P-MAP estimates are better at recovering small features, 
but the reconstructed structures are more blocky, a property described as 
“oversmoothing” in [15]. 


12.6.4 3D Experimental Details 


In the single-axis data-collection scheme [16, 30] in electron microscopy, a 
main advantage of a fully 3D reconstruction using 3D cliques (as opposed to 
using 2D cliques and doing a slice-by-slice reconstruction) is the smoothness 
imposed across neighboring slices. In our discussion ahead, we will, in fact, 
deal with a set of viewing directions adjusted to the grid points. 
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Fig. 12.7. Reconstructions of an image representing the cross section of a macro- 
molecule. From left to right, in the top row are one phantom and its gray-value 
image. In the central row are an optimal thresholding of the gray-value image (cor- 
responding to an “ideal” reconstruction by current approaches), the reconstruction 
by the an ART-with-pixel current approach, and that by the ICM method. The 
number of misclassification are respectively 272, 319, and 387. In the bottom row 
are the reconstructions using the estimators P-MAP and P-MPM, whose number of 
misclassification are, respectively, 147 and 130. 


For 3D label images, we define the domain D as a subset of the face- 
centered cubic (FCC) grid F 


D = (v = (v1, v2, v3) |v E€ Fand0 < v; < V; for p = 1,2,3}, (12.28) 


where 


3 
F- f- (v1, v2, v3) [v €Z", XD = edt. (12.29) 
p=1 
and the V, are positive integers. Apart from a smoother graphic display of 
a surface [23] when using a FCC grid, its grid points are more “evenly” dis- 
tributed, and each has fewer (only 12) face-or-edge neighboring grid points 
than a cubic grid point (that has 18 face-or-edge neighbors). This implies 
less computational burden in the sampling of Gibbs distributions. Here a grid 
point and its 12 neighbors constitute a clique. The local features (i.e., configu- 
rations on a clique) that we are interested in are black regions, white regions, 
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Fig. 12.8. Cross sections (the 30th slice) of typical images corresponding to 10 of 
our 3D GD models. The first nine cross sections are normal to the direction 3, and 
the last one is normal to the direction 1. 


Cartesian and regular walls, small and large convex corners, and small and 
large concave corners; see [34] for details. Figure 12.8 shows the cross sections 
of typical 3D sample images from 10 different GDs, and surface renderings of 
4 of the 10 samples are depicted in Fig. 12.9. 





Fig. 12.9. Surface rendering of the samples 1, 3, 4, and 9 of Fig. 12.8 (from left to 
right and top to bottom). 


Phantoms 


We considered 3D binary images with domain size such that Vi = V2 = 64 
and V3 = 42 (so that |D| = 86,016). Ten phantoms represented biological 
macromolecules, each of which is a discretization of nine spheres. Near the 
surface of the top and bottom spheres are four possible locations for a little 
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Fig. 12.10. Surface renderings of a phantom and one of its cross sections. 


sphere that can be either present or absent with probability 0.5 (so the total 
number of distinct phantoms is 2°%4, but we randomly selected 10 of them 
for the experiments). Figure 12.10 shows the case when all the little spheres 
are present. The discretization was done by labeling white a grid point (in 
D) if it is inside or on a sphere, and it is labeled black otherwise. The pur- 
pose of introducing the little spheres is to evaluate the detectability of small 
structures. 

For the estimation of the parameters of the GD, we used the 3D model de- 
scribed above and randomly took 100 (different from the previous 10) training 
phantoms. The number of training images turned out to be more than suffi- 
cient, since the estimates do not differ significantly from those based on 1000 
or even on 10 training images. 


Projection Data 


In each projection we took parallel lines that pass through all the grid points 
in the domain D, and the lengths of intersection with a rhombic-dodecahedron 
voxel have all the same value. Nine projections with such a property are in 
the direction (0, 1, 1), (0, —1, 1), (1,0, 1), (—1,0,1), (1,1, 0), (—1, 1,0), (1,0, 0), 
(0, 1,0), and (0,0, 1). Neither the condition of passing through all the grid 
points, nor the condition that the intersection length must be equal in one pro- 
jection, is necessary for the validity of the our approaches, but together they 
simplify the implementation. For the jth line (j = 0,...,J — 1), we computed 
the line integral z; of the gray values inside the mathematically described im- 
ages as follows. Let lọ and lı be, respectively, the lengths of intersection of the 
line with the spheres and with the background. Then z; was assumed to come 
from a normal distribution with mean and variance equal to polo + ji l1, where 
Ho = 4 and py = 9. To avoid negativity, we set zj to 0 whenever it was sam- 
pled negative (with probability less than 0.028 in only a few measurements). 
Our v;(w;|y) in the algorithms was considered to be normally distributed 
with mean z;, and we initially intended to set the variance to be c2, j7 Na. 
To avoid nonpositive variance, we replaced 0j by N -w; = N-max(p0, wj), 
where wj is sampled from v;(w;|y). Finally, the simulated measurement was 
taken to be w;, similarly to what we did with 2D images. 
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Reconstruction Methods 


Our proposed approaches were compared with conventional approaches of first 
reconstructing the gray-value image followed by a segmentation. We used ART 
with blobs [22, 35] for the gray-value reconstruction, followed by an optimal 
thresholding. The sequence of projections in ART is the same as the one given 
in the previous subsubsection, and the sequence of lines in one projection was 
such that no blob intersects two consecutive lines. The parameters for the 
blobs were obtained based on the equivalent-grids criterion of [36] (which 
determined the grid sampling distance) and the (first and fifth) zero-crossing 
criterion of [20] (which gave the radius and the rate of decay of the blobs); 
see [34] for details. The thresholding level was chosen to yield the “correct” 
relative concentration of the black and white region. Such a concentration was 
estimated from the training phantoms. 

In our methods, we found that we can run fewer cycles for the samplings 
than in the 2D case. In particular, we ran 3000 cycles to obtain one sample (in 
total, 500 samples were taken for one P-MPM estimate). We observed that 
typical energy does not significantly change even if we increased the number 
of cycles to 5 - 10°. We ran 104 cycles for each temperature of the simulated 
annealing for one P-MAP estimate. The annealing schedule was such that 1/T' 
ranged from 0.1 to 1.25 with intervals of 0.05. The number of cycles and the 
annealing schedule were not optimized. 


Evaluation 


The average percentage of misclassification, which will be denoted by 6, can be 
regarded to as a figure of merit (FOM) in the context of statistical-hypothesis- 
testing-based methodology [17, 26] for the evaluation of the relative efficacy 
of two reconstruction methods. A FOM measures how helpful a reconstructed 
image is for solving a specific problem (or achieving a task) in the application 
area. (Strictly speaking, the FOM in this case should be 100 — ó instead.) 

Here we are interested not only in the “appearance” of the reconstructions 
but in the detectability of small structures as well. To that end, for each 
reconstruction we use two measures: One is 6 and the other is 100 times the 
area under a ROC curve. For details on the latter applied to our 3D phantoms, 
see [32]. 


12.6.5 3D Experimental Results 


In Table 12.3, we report on the quality of the reconstructions using nine 
projections by the current ART-based approach, the P-MPM estimator, and 
the P-MAP estimator. Figure 12.11 shows the actual reconstructions of one 
phantom in one cross section. As far as the measure ó is concerned, in the case 
N = 0.01, the P-MPM estimator is significantly (P-value less than 0.001; see 
Subsection 12.6.1) the best, followed by the P-MAP estimator, and then the 
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Table 12.3. Quality of Reconstruction Using Nine Projections, According to 6 
(percentage of misclassification averaged over the ten testing phantoms; left side) 
and the Detectability Measure (100x area under a ROC curve; right side) (N is the 
noise level) 


Method] N=001 | N=10 | N=40 [N - 091|N - 19[N — 19) 
DE: 
3 
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ART estimator. For N = 1.0, the P-MPM estimator is still significantly the 
best, while the comparison between the other two estimators is not significant. 
For N = 4.0, the ART estimator is significantly the best, followed by the P- 
MPM estimator, and then the P-MAP estimator. 

According to the detectability measure (by the area under a ROC curve), 
for N = 0.01, the differences among the three estimators are not significant, 
and neither are the differences between the P-MPM and the P-MAP estima- 
tors at the three noise levels. For N = 1.0 and N = 4.0, the ART estimator 
is significantly the best. 


12.7 Conclusions and Discussions 


We have created 2D and 3D Gibbs prior models that allow us to directly 
recover the label image from the projection data. We have shown that an 
MPM type of estimator performs better than an MAP type of estimator. 

Our experiments on 2D images indicate that our proposed techniques out- 
perform an ART-with-pixel-based current approach and even an “ideal” cur- 
rent approach (in which the gray-value image is perfectly reconstructed), in 
terms of misclassification. The P-MPM estimator proved to be the best in 
general, under that measure. 

The results of our experiments on 3D images indicate that although the 
P-MPM estimator is superior to an ART-with-blob-based current approach 
in terms of misclassifications (except at the higher noise level), it is not so 
under a detectability measure. 

The main differences between the two sets of experiments are (i) ART 
with pixel basis was used in 2D and ART with blob basis in 3D, and (ii) 
the nature of the Gibbs prior. Our 3D reconstructions may be improved by 
“enhancing” our Gibbs prior model. For example, the cliques of our 3D models 
are relatively smaller (when looking at a cross section) than those of our 2D 
models, and (as mentioned in [10, 33]) the reconstruction quality using small 
cliques can be poor compared to that when using larger ones. 

In this work, for estimating the Gibbs distribution from which typical 
images of an application area are assumed to be samples, we first created a 
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Fig. 12.11. Reconstructions of a 3D phantom (one cross-section). The top image 
is the phantom. From left to right in the second row are its reconstruction using 
ART, an optimal segmentation of the ART reconstruction, the P-MPM and P-MAP 
estimators; all the reconstructions correspond to the noise level N = 0.01 and the 
number misclassification (in the 3D object) are 1231, 888, and 1083. The bottom 
row, with the same arrangement, corresponds to N = 4.0 with 1922, 2069, and 2339 
misclassifications. The size of a phantom is 86,016. 


model and then estimated the parameters of the Gibbs prior for that model 
from the images. An interesting open problem is to determine also the model 
from the images; this means choosing an appropriate set of cliques and local 
features. 

A disadvantage of the MAP estimator is oversmoothing. A weakness of the 
MPM criterion is the lack of distinction between “scattered” and “aggregated” 
misclassifications: A modest number of misclassifications are rather harmless if 
they are scattered, as they would be interpreted as isolated misclassifications, 
which is not the case if they aggregate into some artifact. Therefore, it is 
important to develop and study new estimators that would not present such 
undesirable properties [43]. 
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Discrete Tomography for Generating Grain 
Maps of Polycrystals 


A. Alpers, L. Rodek, H.F. Poulsen, E. Knudsen, and G.T. Herman 


Summary. The determination of crystalline structures is a demanding and funda- 
mental task of crystallography. Most crystalline materials, natural or artificial, are in 
fact polycrystals, composed of tiny crystals called grains. Every grain has an associ- 
ated average orientation that determines the spatial configuration of the crystalline 
lattice. Typically, the structure of a polycrystal is rendered via an orientation map 
or a grain map, in which individual pixels/voxels are assigned a grain orientation 
or a grain label. We present two related approaches to reconstructing a 2D grain 
map of a polycrystal from X-ray diffraction patterns. The first technique makes the 
assumption that each grain is actually a perfect crystal, i.e., that the specimen is not 
deformed. The other method can be applied also when the sample has been exposed 
to moderate levels of deformation. In both cases, the grain map is produced by a 
Bayesian discrete tomographic algorithm that uses Gibbs priors. The optimization 
of the objective function is accomplished via the Metropolis algorithm. The efficacy 
of the techniques is demonstrated by simulation experiments. 


13.1 Introduction 


In nature most materials such as rocks, ice, sand, and soil appear as aggregates 
comprised of a set of small crystals. Similarly, modern society is built on 
applications of metals, ceramics and other “hard materials," which are also 
polycrystalline. 

An example of an undeformed polycrystal is shown in Fig 13.1. The indi- 
vidual crystals are known as grains. Each grain is characterized by its position 
and shape as well as by the orientation of the 3D crystalline lattice (the dis- 
crete lattice of atom positions). The latter property is known as the grain 
orientation. The physical, chemical, and mechanical properties of the ma- 
terial are to a large extent governed by the geometrical features of this 3D 
complex, including neighboring effects (such as the correlation between the 
orientation of two neighboring grains and the morphology of the boundary 
separating them). Remarkably, until recently, no nondestructive methods ex- 
isted for visualizing the grains in 3D. Studies of grains could be performed 


272 A. Alpers, L. Rodek, H.F. Poulsen, E. Knudsen, G.T. Herman 





Fig. 13.1. Grain map of an undeformed aluminum polycrystal. The orientation of 
the crystalline lattice (represented by a gray scale) within each grain is constant. 


only by sectioning the sample and characterizing the surface by, e.g., optical 
or electron microscopy. This way does not allow us to follow the evolution of 
the 3D complex as function of time, e.g., to visualize the growth or shrink- 
age of a grain when the sample is heated. As a consequence of this lack of 
experimental studies, existing models of basic industrial processes, such as 
deformation and annealing, are grossly simplified and typically only deal with 
average properties of the grains. 

Three-dimensional X-ray diffraction (3DXRD) is a novel experimental 
method for characterization of polycrystalline materials [24]. Its primary goal 
is to provide 3D grain maps (3D equivalents of Fig. 13.1) and even 3D movies 
by repeated in situ acquisitions of 3D grain maps during the relevant treat- 
ment of the sample. The method is based on reconstruction using X-rays with 
a set-up similar to that of conventional transmission tomography. The vital 
difference is that in computerized tomography the absorption of the incident 
beam through the sample is probed, while in 3DXRD the diffracted beam is 
probed as it diverges from the sample on the exit side. The diffraction pattern 
on the detector typically is composed of a set of distinct diffraction spots. Ac- 
quiring images at a set of rotation angles, each grain gives rise to 225—30 spots, 
with positions and intensity distributions determined by the local orientation 
of the crystalline lattice. A sketch of the data collection methodology is shown 
in Fig. 13.2. 

Note that the data acquisition is similar to what is usual in discrete to- 
mography (DT), with the important difference that the diffraction spots do 
not correspond to measured sums along straight lines, but rather correspond 
to measured sums taken over more general sets. Our approach differs in this 
sense from approaches formerly known in the DT literature. 

The first 3DXRD grain maps were reconstructed in 2003 by applying a 
constrained variant of the algebraic reconstruction technique (ART) [25]. From 
these and other studies [20, 23], it has become clear that this approach has 
severe limitations: 


(a) The grain map is generated by patching together reconstructions for in- 
dividual grains. Each of these reconstructions is originally represented as 
a real-valued function, and subsequently binarized by using a threshold. 
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Fig. 13.2. Sketch of the 3DXRD geometry. Detectors are positioned perpendicular 
to the beam at distances L1, L2, and Ls. 


'The resulting grain maps often show unphysical ambiguous areas near the 
grain boundaries, where some parts may be associated with more than one 
grain and some with none. 

(b) For each grain, the nature of the diffraction process implies that there is 
only a finite set of possible projections. The exact number depends on the 
experiment, but it may be as low as 5. This sparsity of data interferes 
with the aim of obtaining high-quality grain maps. 

(c) It is of major interest to extend the concept of 3D mapping to deformed 
materials. While the ART formalism can be generalized to this case [23], 
the demands for computer power have so far prohibited such solutions. 


Fortunately, the grain structure exhibits additional properties, which may 
be exploited in the reconstruction: 


(a) The grains can be approximated by discrete objects in the sense that a 
given voxel in the sample belongs either fully to a given grain or not at 
all. 

(b) The grains are simply-connected 3D space-filling objects within the bor- 
ders given by the sample geometry. 

(c) The grain boundaries are smooth. 

(d) The physics underlying the generation of the structure implies that grain 
maps tend to resemble each other, once they have been scaled to the aver- 
age grain size. Hence, it is possible to describe local properties statistically. 


These arguments suggest that tools from discrete tomography [15] should 
be well suited for the reconstruction of grain maps. In this work, we investigate 
the prospect of using Bayesian discrete tomographic algorithms that employ 
Gibbs priors. Our exposition is based on the papers [1, 2, 26]. 

'The outline of this chapter is as follows. We define the concepts of orien- 
tation maps and grain maps in Section 13.2. Unit quaternions are shown to 
be appropriate for representing orientations. In Section 13.3, we present our 
approach for undeformed specimens. The quality of the reconstructed maps is 
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demonstrated and quantified by simulation studies. Similarly, in Section 13.4, 
the formalism and simulation results for the moderately deformed case are 
given. Section 13.5 concludes with a discussion and outlook. 


13.2 Orientations and Grains 


In a crystalline material, the atoms are arranged in a three-dimensional dis- 
crete lattice. The grain can be fully characterized by two properties: 


(a) The lattice, L = {xa + yb + zc|z,y, z € Z}, where a,b,c € R? are given 
vectors not all lying in the same plane and Z is the set of all integers. 
Notably, for a given lattice, a number of symmetry operations may exist 
(such as inversions, rotations, and mirror operations) that map the lattice 
onto itself. Lattices that are invariant under such operations are said to 
exhibit crystal symmetries. 

(b) The orientation of the lattice. Notably, due to crystal symmetries, distinct 
rotations of the lattice may result in the same orientation. 


Most crystalline materials are polycrystals. In case of an undeformed spec- 
imen, the lattice within each grain is near-perfect. As such, each grain can 
be associated with exactly one orientation. Let a 2D section of the specimen 
of interest, D, be discretized into a set of pixels. We define a grain map f 
as a function mapping the finite domain D C Z? into {1,...,1}, where the 
integer | is the number of grains in the map. Clearly, if i and j are pixels in 
the same grain, then f(i) = f (j). We further define an orientation map o as a 
function mapping D into the set of unit quaternions (see Subsection 13.2.1). 
For undeformed specimens, o has to fulfill f(i) — f(j) — o(i) — o(j) for all 
pixels i and j within the same grain, whereas this need not be true in the 
deformed case. 

If an initially undeformed specimen is deformed, the boundaries between 
grains get distorted. At the same time, the lattice is no longer homogeneous 
within each grain, but there is a spatial variation of orientation. With in- 
creasing amount of deformation, these distortions become larger and larger. 
In the following we confine ourselves to moderate degrees of deformation, 
which refers to the case in which it is possible to derive the corresponding 
grain map from a given orientation map. 

Algorithms producing grain maps from given orientation maps are stan- 
dard in the analysis of electron microscopy data [4]. They are based on the 
class of image analysis tools known as connected component or labeling rou- 
tines. These techniques classify the pixels of an image into so-called regions 
by assigning a unique region label to every pixel. 

The aim of this chapter is to establish algorithms that—given 3DXRD 
data—reconstruct both grain maps and orientation maps. The simultaneous 
reconstruction has several advantages, such as speed of computation and the 
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fact that DT tools can be applied. Additionally, many applications demand 
an accurate determination of both orientation maps and grain maps. 

In the following, we will assume throughout that the sample is a monophase 
polycrystal of a known material, implying that all grains have the same a priori 
known crystal structure. 


13.2.1 Representation of Orientations by Unit Quaternions 


Different kinds of formalisms exist to describe orientations. A common ap- 
proach is to describe an orientation by a rotation in 3D space—the rotation 
that has to be performed to obtain the actual arrangement of grain atoms, 
starting from a fixed reference system. In this way, orientations and rotations 
seem to be synonyms of the same idea. The issue of crystalline orientations is, 
however, more complicated: The same orientation can sometimes be achieved 
by two or more quite different rotations, due to the crystal symmetries. 

It is critical to choose an appropriate representation for orientations. For 
example, it is important that every orientation of interest can indeed be rep- 
resented. Furthermore, the scheme should provide a simple and useful way 
of visualization; it should include a notion of “distance” or “dissimilarity” 
d(01, 02) between a pair (01,02) of orientations, and it should allow easy /fast 
computations, for example, for the “composition” of two orientations. 

Representation by unit quaternions has several well-known advantages, 
such as that there is a metric based on the norm of the 4D vectors of quaternion 
components (see Subsection 13.2.2) and there is a continuous correspondence 
between unit quaternions and points on a 4D sphere of unit radius. In the 
following, we summarize the most important properties of unit quaternions, 
mainly focusing on how they represent 3D rotations. Before proceeding with 
the exposition, however, a few notes are in order concerning different aspects 
of orientations. 

It is a well-known fact that any rigid transformation of 3D space into 
itself can be described by a translation of the origin followed by a rotation 
around an axis through the translated origin. Since from the point of view of 
orientations, the translation part of this process is irrelevant, every orientation 
is describable by a unit vector n (specifying the axis of rotation) and an angle 0 
(specifying the amount of counterclockwise rotation around the directed axis). 
However, such a description (n, 0) of orientations is not unique: (—n, —0) or 
(n, 0 + 27) will result in the same orientation. 

What is the “distance” between the orientations defined by (n1,61) and 
(n5, 05)? Heuristically, we may argue that it should be (a monotonic function 
of) the smallest positive angle 03 such that the rotation (n,01) followed by 
the rotation (n3,63), for some unit vector ns, will result in the orientation 
defined by (n5,05). At this stage, it is far from obvious how such a distance 
should be computed. Quaternions provide us with a natural way of achieving 
that, as discussed in Subsection 13.2.2. 
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There is an inherent difficulty with the definition of crystalline orienta- 
tion: Even with the knowledge of the location of the crystal lattice points in 
space (relative to some fixed coordinate system), the average orientation of 
the grain (in the sense of how it had to be rotated relative to some reference 
position in order to get into its current position) is not defined in an obvi- 
ous and unambiguous manner. Consider, for example, the simple cubic lattice 
whose lattice points are at locations (x, y, z) with x, y, and z integers. Suppose 
further that we rotate the lattice by sn radians counterclockwise about the 
positive z-axis. The resulting arrangement of lattice points is indistinguishable 
from what would be obtained by rotating the lattice by 7 radians counter- 
clockwise (or, for that matter, clockwise) about the positive z-axis. Thus, due 
to the lattice structure (more specifically, the rotational crystal symmetries), 
the same observable arrangement of lattice points can be obtained by very 
different rotations; which of these should then be selected for the definition 
of “orientation”? We return to this important point in Subsection 13.2.2, but 
we will ignore its consequences in the discussion until then. 

Quaternions and their applications to rotations and orientations have a 
well-developed theory [3, 8, 9, 18, 22]. In this subsection, we give only a 
skeleton development, restricting our attentions to those definitions and facts 
that we absolutely need for our purpose. 

A quaternion q is a 4-tuple (a, b, c,d) of real numbers. The product of two 
quaternions is defined by 


(a1, b1, c1, d1) (a2, b2, c2, d2) = (a3, b3, c3, da) , (13.1) 
where 


a3 = à102 — b303 — cyc2 — did», 





b3 = a4105 + baz + cid — d4c5, (13.2) 





c3 = @1C2 — bıd2 + c1a2 + dibs, 

d3 = a1d» + bycg — c109 + dy a2. 
It is easy to check that multiplication is associative, and so one can use un- 
ambiguously the notation pqr for the product of the three quaternions p, q, 


and r. Note also that (1,0,0,0) is an identity element; i.e., for any quaternion 
q, we have 


(1,0,0,0)q = q =q(1,0,0,0). (13.3) 
The norm of the quaternion (a, b,c, d) is defined by 


(a,b,c, d| = Va 3-2 3-2 4 a. (13.4) 


We now specify two useful subsets of the set of quaternions. For both 
of these, it is helpful to introduce the notation (a,b), where b is the 3D 
vector (b, c, d), to abbreviate the quaternion (a,b, c, d). The conjugate of the 
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quaternion q = (a,b) is defined to be q = (a, —b). Note that the conjugate 
of the product of two quaternions is the reversed product of their conjugates; 
i.e., q1d2 = G2q1. 

The set of pure quaternions consists of quaternions of the form (0,b). Note 
that there is an obvious 1-to-1 mapping of the 3D Euclidean space onto the 
set of pure quaternions. 

The set of unit quaternions consists of quaternions whose norm is 1. It is 
easy to check that the product of two unit quaternions is a unit quaternion, 
and that the conjugate of a unit quaternion is also a unit quaternion, and it 
is, in fact, the case that, for every unit quaternion q, qq = (1,0,0,0) = qq. 
Another useful property is that if q3 = Giqeqi, where qı is a unit quaternion 
and q2 is any quaternion, then the first component of qg is the same as that 
of q2. 

Every unit quaternion q can be written as 


q = (a,b) = (cos (30) ‚n sin (48), (13.5) 


for some real number 0 and 3D unit vector n. We are now going to discuss 
that this unit quaternion can be used to achieve a rotation in 3D about the 
directed axis n by the counterclockwise angle 0. Let b be a 3D vector. Then 
q(0, b)q will be a pure quaternion (0, b^), where b’ is the vector obtained from 
b by the rotation about the directed axis n by the counterclockwise angle 0. 

As an example, consider the rotation by 3x radians counterclockwise about 
the positive z-axis. According to the discussion above, this is represented by 


the quaternion 


see (13.5). It is easy to check, using (13.1) and (13.2), that for this q 


q(0,0,1,0)q — (0.— 35. 25.0) , (13.7) 
corresponding to the fact that the specified rotation maps the 3D vector 


(0, 1,0) into (75 4.0). 

Note that the unit quaternion q does not determine (n,0) uniquely, but 
all the (n,0) that satisfy (13.5) describe the same transformation of the 3D 
space onto itself, since any of them can be obtained from any other by steps 
consisting of changing 0 by 47 or simultaneously taking the negative of both 
n and 0. The transformation of the 3D space also does not change by adding 
27 to 0, but this changes the quaternion q of (13.5) into —q. Thus, the unit 
quaternions q and —q define the same rotational transformation of the 3D 
space into itself; there is a 2-to-1 mapping of the set of unit quaternions onto 
the set of such rotational transformations. 

A particularly useful and elegant consequence of this approach is that 
the rotation represented by the unit quaternion qı followed by the rotation 


278 A. Alpers, L. Rodek, H.F. Poulsen, E. Knudsen, G.T. Herman 


represented by the unit quaternion q2 is the composite rotation represented 
by q2qi, since for any quaternion p we have 


q»(dipq;)a» = (q2q1)P(4142) = (a2a1)p(a241) - (13.8) 


13.2.2 Distance in Orientation Space 


We now continue with the central issue of how to define the distance d(oi, 02) 
between the orientations o1 and o». Initially, we ignore the underlying crys- 
talline lattice and consider only the distance, denoted by r, between rotations. 

According to the heuristic stated in Subsection 13.2.1, such a distance 
between two rotations should be a monotonic function of the smallest non- 
negative angle 05 such that the first rotation followed by a third rotation 
(na, 03), for some unit vector ns, will result in the second rotation. If the first 
rotation is represented by the unit quaternion q;, and the second by the unit 
quaternion qs, then a unit quaternion qa that corresponds to the desired third 
rotation is q2q1, since 


qsqdi = (q2q1)qı = q»(qd141) = q2(1,0,0,0) = qo. (13.9) 


Since q3 and —qg define the same rotation, the first component a3 is nonneg- 
ative for at least one of them, and the corresponding smallest nonnegative 03 
must lie between 0 and z. This is the value that indicates the dissimilarity, 
and so r(q1,q2) := 1 — |a3| can be used as the definition of distance. Looking 
at the first row of (13.2) for the product qoqi, with qi = (a1, b1, c1, dı) and 
q2 = (a2, b2, c9, d2), we see that 





a3 = a1a2 + b1b5 + C109 + didə. (13.10) 


It is clear that a3 is nothing but the dot product of qı and q2 when they 
are considered as 4D unit vectors. In particular, |as|, and hence r(q1,q2), 
does not depend on the choice of the unit quaternions used to represent the 
first two rotations: If one used the alternative representations —q; and —q2 
for one or both of these rotations, then the value of |a3| would remain the 
same. It can be verified that r(q1,q2) = 0 if, and only if, qı = +q2, and that 
r(qi, q2) = r(q2, q1) (in other words, we have the desirable property that the 
distance from qı to q» is the same as the distance from q2 to qi). Also, it is 
the case that r(qi,q2) = r(d1, G2) = r((1,0,0,0), q2q1). 

Now we return to the issue raised in Subsection 13.2.1, namely that ori- 
entations are associated with a crystal structure, and hence the rotational 
crystal symmetries have to be taken into account. To illustrate this problem, 
let q denote a fixed unit quaternion describing an arbitrary rotation, and s 
be a unit quaternion describing a symmetry rotation of the crystalline lattice 
(i.e., a rotation that maps every lattice point onto a lattice point). It is easy 
to see that the rotation qs results in the same orientation of the lattice as the 
rotation q. Thus, q and qs are said to be crystallographically equivalent. 





13 Discrete Tomography for Generating Grain Maps of Polycrystals 279 


As an example, let us consider the three cubic crystal lattices: the simple 
cubic, the face-centered cubic, and the body-centered cubic lattice. These lat- 
tices have the same crystal symmetries: There are 48 symmetry operations, 24 
of which are pure rotations. So, for every rotation q, there are 23 other rota- 
tions that are crystallographically equivalent with it. Many important metals 
have a cubic lattice. For instance, iron has a body-centered cubic lattice, while 
gold, silver, aluminum, copper, and lead all have a face-centered one. Indeed, 
the study of relative orientations in the presence of crystal symmetries has 
been an active field of research in the analysis of the texture of materials for 
over 30 years [12, 13, 14, 17, 22]. 

We are now ready to define the distance d(q1, q2) between two orientations 
of the lattice brought about by rotations represented by unit quaternions qi 
and q2. If the quaternions qı and q» correspond to the orientations of two 
neighboring grains, or of two neighboring pixels in the orientation map, then 
d(qi,q2) should measure the size of the disorientation between the grains/ 
pixels, taking into consideration the underlying crystal symmetries. Following 
the methodology presented in [12, 14], this can be done as follows: Let S denote 
the set of all symmetry rotations of the lattice. (Since inversions, translations, 
and reflections are irrelevant from the point of view of equivalent orientations, 
it is enough to restrict S to rotational operations.) Then, by definition, 


d(qi, q2) — min r(qisi, q282) , (13.11) 


S1,82€ S 


i.e., the smallest distance between a quaternion that is crystallographically 
equivalent to q; and another quaternion that is crystallographically equivalent 
to q2. 

Sometimes it is intuitively informative to refer to the distance as if it were 
measured in degrees. This can be done by associating a disorientation angle 
with every distance value. The following is the precise definition: We say that 
distance d (recall that 0 < d < 1) is equivalent to the disorientation Ó if, 
and only if, 0 < 0 < 180 and cos (36) = ] — d. Note that there is one, and 
only one, such 0 for any d such that 0 € d < 1. That this is a reasonable 
convention follows from the definition of r(q1, q2). In the material presented 
below, whenever we specify that a disorientation is 0, then we mean by this 
the corresponding distance d — 1 — cos (16). 

We are now going to show that, in fact, 


d(qi,q2) = min r((1,0,0, 0), sq2ā1) i (13.12) 


We first recall that r(qisi,q2s2) = r((1,0,0,0),S2G2qis1). To evaluate the 
right-hand side, we need the first component of S2Q2q1S81 = 518152q2q1S1. 
Recalling a previously stated property of this product, we see that its first 
component is the same as that of s1S2q2qi. Since the conjugate of a crystal 
symmetry is a crystal symmetry and the product of two crystal symmetries 
is also a crystal symmetry, the claim (13.12) follows. 
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Though it would be possible to determine the value of r in (13.12) for 
each s one by one, and then to choose the smallest one, this would really slow 
down the reconstruction procedure. (For example, S has 24 elements for cubic 
lattices.) A more efficient method is to use a look-up table of the values of 
min r((1,0,0,0), sq), based on quantization of the unit quaternion q. We now 
s 


show that there is a fairly easy way to implement such a look-up table. 

First, it should be recalled that the components of a unit quaternion have 
magnitudes less than or equal to 1. For any rotation, the first component a 
of q can be chosen to be nonnegative. Since a is uniquely determined by the 
other components, it can be ignored for the construction of the look-up table. 
For the sake of simplicity, let us suppose that the whole range [- 1,1] of the 
other three components is sampled using M values. This way we get a table 
T of M? values such that, at those locations for which b? + c? + d? < 1, the 
element taca is the needed value for the unit quaternion q = (a, b, c, d) for the 
corresponding nonnegative a. If b? +c? -- d? > 1, the value of ty.q is undefined. 
'This convention makes the table easily addressable, at the cost of some wasted 
memory. The sampling resolution M may simply be chosen based on memory 
considerations. For example, having M — 101 and using 8 bytes to represent 
real numbers, T consumes 8.24 - 109 bytes (about 7.86 megabytes). 

To find the desired d(qi,q2), we calculate q3 = (a3, b3,c3,d3) according 
to (13.9). If ag is negative, we replace q3 with —q3. Then d(q1,q2) is approx- 
imated by ty a4, where bs, c5, and d} are the sampling points nearest to bs, 
c3, and da, respectively. 


13.3 The Undeformed Case 


In this section we present an algorithm for grain map reconstruction. Note 
that in the undeformed case the orientation map follows from the grain map 
once the orientation of one pixel of every grain has been determined. 

Until recently, the following method has been used for grain map recon- 
struction [25]. A program called GRAINDEX, based on ray tracing, analyzes 
the diffraction spots on the detector and identifies subsets of those spots in 
such a way that all spots in a single subset originate from a single grain. 
GRAINDEX also provides the orientation of these grains, so in this unde- 
formed case there remains only the task of reconstructing the grain map. 
Data from the spots of a single subset form a real-valued vector b, and then 
an algebraic reconstruction technique (ART) [11] is applied to solve a system 
of linear equations Ax — b, where A is a real-valued matrix describing the 
experimental geometry and x is a 0/1-vector identifying the pixels in this 
particular grain. These separately reconstructed grains are patched together 
to form a grain map, but due to several reasons (e.g., noise in the data and 
the “real-valued” nature of the ART algorithm) one finds that there are many 
“ambiguous pixels" in such maps, i.e., some are assigned to multiple grains or 
to no grains at all. 
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The algorithm that we present in this section was originally designed to 
work on such reconstructed grain maps, which had a small number of “am- 
biguous pixels.” In this sense it is a restoration algorithm. Surprisingly, as 
demonstrated in the next section, we can successfully apply the algorithm to 
grain maps where initially only one pixel of each grain is known in advance, 
i.e., where all other pixels were considered as “ambiguous.” Since GRAIN- 
DEX provides such information, one can work directly on such low-quality 
grain maps without invoking the ART routine. Our exposition in this section, 
however, discusses an approach for assigning to grains those pixels that are 
not unambiguously assigned by ART. 


13.3.1 Algorithmic Overview 


The central idea of the approach is to take as input a 2D grain map f (gen- 
erated by ART as described above) and the diffraction data. Our proposed 
algorithm only changes pixels of the grain map that are classified “ambigu- 
ous.” The task is to assign these pixels to the “correct” grains. To this end, 
we utilize a Markov-chain-based Monte Carlo method, namely the Metropolis 
algorithm [21], which has proved to be useful in other applications of DT, such 
as the reconstruction of binary (or gray-leveled) images from projections [15]. 

We assume that the grain maps are random samples from a Gibbs distri- 
bution [5, 10] defined by 


a(f)- Ze PH), (13.13) 


where z(f) is the probability of occurrence of the grain map f, G > 0 isa 
parameter analogous to the inverse of the temperature, Z corresponds to the 
partition function in statistical mechanics, and H(f) is the energy of f. This 
energy H(f) is based only on local 3 x 3 features (called configurations) of the 
image. The local features of interest are partitioned into equivalence classes 


(for example, one of these is associated with edges) Go,..., Gc and 
C 
H(f) 2 - M, N(G., f)U., (13.14) 
c=0 


where N(G., f) counts the number of times a configuration from Ge occurs 
in f, and the number U, is the potential associated with the class Ge (for 
0 € c € C) [19]. These potentials are chosen so that random samples from 
the Gibbs distribution resemble the arrangement of grains in polycrystals. 
We also need to take the diffraction data P into account. (Such data can 
be modeled as a real-valued vector.) Let P ; denote the simulated diffraction 
data given the image f, and let o, 8 € R be given positive parameters. We 
apply the Metropolis algorithm to the nonzero-valued distribution y defined 
by 
— 1 .-6(#(/)+allPs-Pll,) 
F) = ze 


13.1 
P | (1315) 
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where ||-||, denotes the £;-norm. The reason for choosing y as in (13.15) is that 
we want to find the grain map f that minimizes H(f)+a ||P s — P||,. This can 
be provided within an adequate running time by the Metropolis algorithm. 

A single step of the Metropolis algorithm may be described as follows. 
One of the originally ambiguous pixels is randomly selected together with an 
alternative assignment of the pixel to a grain. Then we calculate p — W, 
the ratio of the probabilities of the new grain map f’ (which we obtain by 
the alternative assignment of the selected pixel) and the old grain map f. We 
accept the alternative assignment with probability min{1,p}. One cycle of 
the Metropolis algorithm consists of n Metropolis steps, where n is the total 
number of originally ambiguous pixels in the grain map. 

Note that our Gibbs distributions are defined on multicolored images. How- 
ever, in order to keep the number of parameters low, we compute H(f) by 
suitably resorting to binary configurations, as explained in the next subsec- 
tion. 


13.3.2 Algorithmic Details 


For each pixel in the multicolored grain map, we define its local configuration 
as a 3x3 array of black and white pixels as follows: The central pixel is always 
white and any other pixel is white if, and only if, it belongs to the same grain 
as the central pixel. These configurations are partitioned into seven classes 
Go, G1,...,G6, each containing configurations of similar morphology, such as 
“grain interior," “grain edge,” etc. For 1 < 4 < 6, the class G; consists of the 
configurations illustrated in Fig. 13.3 and all the configurations that can be 
obtained from it by a sequence of 90° rotations around the center and mirror 
images about the central vertical line. Configurations not in any of G1,..., G6, 
are put into Go. 








Gi GQ G3 G4 Gs Gs 
Fig. 13.3. The configurations G1, ..., Ge of a 3x3 clique that we use in our model. 










































































Next we approximate the distribution of grain maps by a Gibbs distribu- 
tion (13.13). For this we need to define the potentials U; of (13.14). We use 
an approach based on counting 3 x 3 configurations in typical microstructures 
(the training set) for determining the potentials. 

The restoration algorithm seeks to maximize (13.15). Both a and 6 are 
system parameters to be optimized by simulations. The maximization is 
performed by the Metropolis algorithm [21], which is directly applicable to 
(13.15). 
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Evidently, different samples will give rise to different estimated Gibbs dis- 
tributions. It would be cumbersome, in some cases even impossible, to estab- 
lish the required potentials prior to every 3DXRD experiment. However, grain 
microstructures are, broadly speaking, similar (e.g., grain growth is often as- 
sociated with self-similar patterns). Hence, we predict that the algorithm is 
sufficiently robust with respect to the choice of the potentials, and so one set 
of parameters can be used for a large set of samples. 

To test the robustness of our algorithm, we used six different sets of po- 
tentials U* = (U1, U2, U3, Us, Us, Ug), where Up was always set to 0: 

Ut = (1.4, 0.71, 0.61, 0.79, 0.5, 0.61), a medium-scale grain structure; 


U? = (1.4, 0.8, 0.8, 0.94, 0.19, 0.82), a coarse-grained structure; 

U? = (1.4, 0.94, 0.94, 1.09, 0.12, 1.01), a fine-grained structure; 

U4 = (1.4, 0.91, 0.91, 1.08, 0.02, 1.0), a partially recrystallized sample; 
U® = (1.5, 1.2, 0.84, 1, 1.25, 0.6) displays grain-like features; 


U® = (0.5, 0.4, 1.0, 0.8, 0.1, 0.6) does not produce grain-like features. 

The potentials U1,...,U? were determined by methods from [6] using a 
training set of aluminum grain samples; U® was chosen arbitrarily. Unless 
stated otherwise, we use the Gibbs prior defined by U?. 


13.3.3 Results 


Figure 13.4 shows one aluminum grain map of 128 x 128 pixel size containing 44 
grains (left image), and its reconstruction using ART (right image). The right 
image contains 1490 white pixels, which correspond to ambiguous points that 
ART was unable to reconstruct. Our approach tries to resolve the ambiguous 
white pixels; an ideal output of our algorithm would be the retrieval of the left 
image. We remark that the ART reconstruction is not perfect; even among the 
nonwhite pixels, 682 pixels are incorrectly assigned in Fig. 13.4 (right). Since 
our algorithm processes only white pixels, there is no possibility of correcting 
those wrong assignments. 

Taking the original image, we simulated noiseless detector data and ran our 
algorithm on the image with ambiguities using a 3 GHz Pentium 4 processor. 
Initially, the white points were randomly assigned to the grains that surround 
the white region in which they occurred. The size of the detector (in pixels) 
was 1024 x 1536, and we used 91 projections provided by equally spaced 
rotation angles w between —45 and 45 degrees. In our simulations, it turned 
out that only roughly 0.02% of the pixels on the detectors were actually hit 
by diffraction beams (resulting in a nonzero value), and most grains produced 
approximately a total of eight spots on the detector. Every diffracted beam 
recorded on the detector hits only one detector pixel and adds a uniform 
amount to its intensity. 

Based on the set of Gibbs potentials U+, the diffraction images, and the 
ambiguity areas defined above, a series of restorations was made with varying 
system parameters a, 3, the number of Monte Carlo cycles (MCC), as well 
as the additive noise term. The noise was implemented by adding a value of 
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Fig. 13.4. The left image shows a typical grain map, which was experimentally 
determined by electron microscopy. The right image shows the grain map (produced 
by ART) where ambiguities have to be resolved (white points). 


1 to randomly chosen detector pixels. In the following, noise levels will be 
indicated in percent as the ratio of the total number of added 1’s to all the 
detector pixels. The quality of the restorations is determined by the “number 
of errors,” which is the number of pixels in the white areas that have been 
assigned to a grain in the restored map that is different from the grain in the 
original map. To improve the statistics, each simulation was repeated 10 times 
with different initial seeds to the Metropolis algorithm. The restorations were 
found to converge rapidly. The running time for each restoration (based on 
1000 MCC) was 10 seconds. 

The variation in the number of errors as a function of a and ( using the 
Gibbs prior associated with U! is shown in Fig. 13.5. For noise levels clearly 
below 100% — as expected for the level of noise in real 3DXRD experiments 
— a broad optimal range is around (a, 8) = (1,1). In the experiments that 
follow, we use these values. 

The variation with noise of such optimized restorations is compared to 
results for projections only in Fig. 13.6. “Projections only” implies no use 
of Gibbs priors, i.e., based on (f) = ge lPs-Pil, instead of (13.15). Also 
shown is the result of a pure 2D-ART reconstruction based on the simulated 
diffraction images. Ambiguous pixels were in this case allocated to the grain 
yielding the highest rational value among the single-grain reconstructions. 
There are significant differences in performance among the three methods. 

Based on the six sets of potentials U1,...,U, but otherwise with the 
same settings of system parameters (a = 6 = 1), restorations were made 
to the grain map shown on the right in Fig. 13.4. The results are shown in 
Fig. 13.7. Evidently, the quality of reconstruction based on the five sets of 
potentials related to grain-like features is nearly identical. This is seen as 
a strong indication that reconstructions of reasonably similar samples with 
unknown microstructures can be based on a set of Gibbs potentials derived 
from electron microscopic investigations of one representative. 
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Fig. 13.5. Optimization of system variables o and 8. For various (a, 3), the varia- 
tion in restoration quality—determined as the total number of ambiguous pixels in 
the reconstructed map assigned to the wrong grain—is shown as a function of the 
noise in the simulated detector images. 
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Fig. 13.6. Reconstruction quality as a function of the amount of additive noise in the 
simulated detector images. Two map restoration methods are compared: restoration 
from projections only and restoration from projections plus the Gibbs prior based 


on U!. The result for reconstruction from noiseless data based purely on 2D-ART 
is also indicated. 
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Fig. 13.7. Test of robustness. The quality of restoration is shown as a function of 
noise for six sets of Gibbs potentials U!, U?, U?, U*, U?, and U$. 


Additionally, we investigated the effect of varying the average number 
of useful diffraction spots per grain (at a noise level of 0% and 100%; see 
Fig. 13.8 left and right, respectively). The number of spots was varied by 
removing spots from the simulated detector images. The removal was carried 
out arbitrarily, by terminating the spot simulation (favoring no particular kind 
of spots) after a prescribed amount of spots occurred on the detector. Again 
the superior quality of the mapping based on the restoration algorithm with 
Gibbs priors over the pure 2D-ART approach is evident. 
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Fig. 13.8. Reconstruction quality as a function of the average number of useful spots 
per grain. Left: At a noise level of 0%, two map restoration methods are compared 
with pure 2D-ART: restoration by projection only and restoration by projections 
plus the optimized Gibbs priors. Right: Results for a noise level of 100%. 


Finally, we tested the extreme approach in which simulations were re- 
peated with an initial grain map with all the 10,299 (except for 44) nonblack 
pixels of the grain map in Fig. 13.4 assigned to be white points. The 44 as- 
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signed pixels corresponded to 1 pixel per grain—namely the pixels closest to 
the individual centers of mass. These pixels may be thought of as seeds for 
the algorithm. The running time was six times longer than for the results 
presented before. The total number of incorrectly assigned pixels in the three 
cases of reconstruction from noiseless data with Gibbs priors, “Projections 
only” and 2D-ART, respectively, are 120, 586 and 5482. Notably, the total 
number of errors is smaller than the corresponding result with an initial map 
provided by 2D-ART, while the number of errors in the ambiguous parts is 
larger. The results for the 100% noise case are much worse than those ob- 
tained using an initial map, due to the fact that the Metropolis algorithm 
gets stuck at a local maximum, which might be resolved by vastly increasing 
the number of MCC. These results indicate that an optimal trade-off between 
the accuracy of initial map and the presented method with respect to running 
time, noise, and reconstruction quality has to be found. This optimum may 
depend on specifics of the grain map and is a topic of current research. 


13.4 The Moderately Deformed Case 


Now we proceed toward the more general problem of reconstructing orienta- 
tion maps of deformed samples. This procedure is of high relevance for many 
industrial applications that seek to assess the effects of structural deforma- 
tion and of stress on the physical and mechanical properties of polycrystalline 
materials. The ultimate goal is to obtain orientation maps of highly strained 
specimens by means of reconstruction, but this turns out to be a complicated 
task. Instead, we restrict the discussion to moderate deformations. We have 
coined this term to indicate levels of deformation at which it is possible to 
deduce the (deformed) grain map from the orientation map. This restriction 
is expected to allow reconstructions of acceptable quality even under unfa- 
vorable circumstances (e.g., high background radiation level or low photon 
count). 

The approach presented below is again a kind of restoration algorithm 
that is an extension of the technique introduced in Section 13.3. The method 
was originally designed as a “post-processing” tool that can recover the com- 
plete orientation map from a partially reconstructed map obtained via some 
other procedure, but it turned out that it can be successfully utilized as a 
reconstruction algorithm. 


13.4.1 Algorithm Overview 


The aim is to reconstruct simultaneously a 2D orientation map/grain map pair 
of a cross section of the specimen from a set of diffraction patterns acquired 
by 3DXRD. 

The algorithm described here is strongly motivated by the technique dis- 
cussed in Subsection 13.3.1. In particular, one is looking for an orientation 
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map whose simulated projections approximate the measured ones as closely 
as possible, and for a grain map that exhibits typical features of moderately 
deformed polycrystals. As mentioned in Section 13.2, the classic way to con- 
struct grain maps from deformed orientation maps (as derived from electron 
microscopy) employs connected component techniques. These are based on 
the fact that orientations of neighboring pixels within any grain are related 
to each other, as can be observed in real-life polycrystals. In other words, 
the orientation map reveals a sort of homogeneity or smoothness inside every 
grain, and we make use of this principle. 

Our approach is superior to the techniques just mentioned in the sense 
that it is assisted by 3DXRD diffraction measurements, hence making non- 
destructive dynamic studies feasible. Also, in comparison to, e.g., ART-based 
3DXRD reconstructions, the solutions are determined by using the original 
(unaltered) projections, that is, diffraction spots need not be discarded in case 
of spot overlap. The latter may prove useful for larger grain maps and for 
higher levels of deformation, since these usually give rise to many overlapping 
diffraction spots. 

As before, our aim is to associate with each pixel i € D C Z? in the 
area of interest both a grain label f(z) € {1,...,/} and an orientation o(i) 
represented by a unit quaternion (see Subsection 13.2.1). The notation f; will 
be used to designate the set {i € D | f(i) = ¢} of indices of all the pixels 
associated with the grain labeled £ € (1,...,1). In addition, the special grain 
label 0 will also be employed during the course of reconstruction in order to 
indicate ambiguous pixels whose grain membership is not yet decided. 

Two more concepts need to be established before proceeding. Recalling 
the definition of distance between orientations, cf. (13.11) and (13.12), the 
average orientation of a set of orientations represented by the unit guaternions 
qı,- - -, Qn is represented by a unit quaternion q that minimizes » 7,. , d(qr, q). 
The orientation spread of grain f; is defined as the maximal distance between 
the average orientation of f; and the orientation of any pixel constituting fe. 

'The following requirements have to be satisfied for a successful application 
of the algorithm: 


(a) The number l of grains should be fixed and known beforehand. 

(b) An initial guess of average orientations, orientation spread, centers of 
mass, and approximate morphologies of grains should be available. A pro- 
gram called GRAINSWEEPER [24] is under development; its specific pur- 
pose is to provide such information on grain maps and orientation maps 
via the analysis of 3DXRD diffraction patterns. 

(c) The statistical distribution of typical grain morphologies in moderately 
deformed orientation maps is available. 


As explained at the end of Subsection 13.2.2, orientations are discretized 
using an appropriate quantization step. (This is not only an issue of memory 
requirements or reconstruction speed. Specifically, it is a fundamental prop- 
erty of 3DXRD that the obtainable spatial resolution—with respect to grain 
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position inside the specimen — and the angular resolution — related to the 
precision with which orientations can be measured — of detectors are not 
independent. See [24] for a deeper discussion.) This effectively means that the 
set of possible orientations is known and finite. Since there are finitely many 
grains in the cross section, the problem becomes a DT reconstruction task. 

Our reconstruction procedure is an extension of the Bayesian technique 
described in Section 13.3 for the undeformed case, but was also inspired by 
a similar approach for gray-valued images described in [7]. Since the orienta- 
tion of a pixel within a particular grain is likely to be similar to that of its 
neighbors, and all moderately deformed grain maps show similar morpholog- 
ical features (depending on the material and the magnitude of deformation, 
of course), we decided to model both maps simultaneously by a Gibbs distri- 
bution. This has the advantage that only local features need to be specified, 
making the description compact and the algorithm efficient. 

The probability of occurrence of a grain map/orientation map pair (f, o) 
is given by, similar to (13.13), 


1 
z(f,o) = ZO (13.16) 


It is the construction of the energy functional H(f,o) that carries the de- 
sired properties of a grain map f and of an orientation map o. As presented 
more precisely below, this functional is calculated as a weighted sum of clique 
potentials. It is defined as 


H(f,o) = Hi(f,o) + HX(f). (13.17) 


The first term establishes a homogeneity condition: 


l 


Hi(f,o) =-X | M AGc(o)+ M rvc(o) | , (13.18) 


£—1 C€C,, CECy 2 
with 
_ (d(o(é),0(5)))? 
Bujo) se s - (13.19) 


By C,¢ we denote the set of all horizontal and vertical pair cliques (i.e., pairs 
of pixel indices) within grain fe, while Cx, is that of all diagonal pair cliques 
inside grain fe. The real-valued coefficients A; and Az determine the contri- 
bution of each type of interaction, and the free real parameter 6 > 0, together 
with the A;, controls the degree of homogeneity. (In fact, 6 is related to the 
maximal orientation spread over all the grains, or, more explicitly, to the max- 
imal distance between the orientations of adjacent pixels within any grain.) 
As discussed in Subsection 13.2.2, the nonnegative function d(o(i), o(j)) mea- 
sures the distance of two orientations; a smaller value indicates more similar 
orientations [see (13.12)]. 
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The second term in H(f,o), namely H2(f), models the borders between 
neighboring grains. That is, the purpose of this term is to capture typical 
grain-like features of moderately deformed maps. We used an H2(f) that is 
defined exactly as in (13.14). As demonstrated in Subsection 13.3.3, the effects 
of such a term are very robust with respect to the choice of clique potentials. 
Moreover, moderately deformed grains, by definition, used to be undeformed 
before deformation. As the morphology changes during deformation are rel- 
atively small, we claim that the set of configurations shown in Fig. 13.3 is 
appropriate for moderately deformed grain maps, too. The clique potentials 
U! introduced in Subsection 13.3.2 were used for implementation. 

Having defined the desired distribution of grain maps and orientation 
maps, we can now formulate the posterior distribution in terms of the physical 
measurements. Analogously to (13.15), we end up with the goal of maximizing 
the objective functional 


en re eP Gejt Ptal PeP), (13.20) 


[Note the replacement of P ; in (13.15) by Po. This is necessary, since orien- 
tation is not unambiguously determined by the label in the deformed case.] In 
the present implementation, the Metropolis algorithm [21] is being employed 
for this purpose. A major advantage of this iterative Monte Carlo approach 
is that it does not need the value of y(f, 0) itself during optimization. 

At every iteration of the Metropolis algorithm, a new pair (f^, o") of maps 
is devised from the current ( f, o) as described ahead. The new pair is accepted, 
as explained in Subsection 13.3.1, based on the ratio: 





Soo) 


The new approximations (f^, o") are constructed according to three prin- 
ciples: 


(f! o) eo muore (IP, -PII,-IIP.-PII)) (13.21) 


(a) An ambiguous pixel may only be given the label and orientation of an 
adjacent nonambiguous pixel. 

(b) A nonambiguous pixel may retain its association with a grain while its 
orientation may be replaced with one of its nearest neighbors in orientation 
space. 

(c) A nonambiguous pixel at a grain boundary may be given the same label 
and orientation as a neighboring pixel associated with an adjacent grain. 


More specifically, ( f^, 0’) is generated from (f, o) as follows: 
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Let f'(i):— f(i) and o'(i) := o(i) for all i € D; 
Randomly choose a pixel i € D; 
if at least one neighbor of i is nonambiguous 
Randomly choose a nonambiguous neighbor, pixel j; 
if pixel i is ambiguous 
Set f'(i) = f(j) and o'(i) = o()s 
Accept (^, o") unconditionally; 
else 
if f()-f) 
o' (à) :— q, where q is one of the orientations that are adjacent 
to either o(i) or o(j) in orientation space; 
Decide on whether to accept (f^, o); 
else 
PO = SO), o) := ol); 
Decide on whether to accept (f^, 0’); 
endif 
endif 
endif 


'The optimization procedure can, in theory, be started from an arbitrary 
pair (f,o) of maps, but f is usually initialized so that most of the pixels are 
assigned the ambiguous label. The exact way of their construction is discussed 
in the next subsection. 

The relaxation parameter 0 (inverse of the temperature) is used to control 
the acceptance ratio: A lower @ increases the probability of accepting (f’, 0’) 
even when the value of the objective functional declines, while a higher 6 
“freezes” the system into accepting only those (f’,o’) that result in an im- 
provement of 4( f, o). The results we are going to report in the next subsection 
were produced by keeping 8 constant all the time, and stopping optimization 
after a certain number of iterations. This is a trade-off between adequate run- 
ning speed and the possibility of unwanted local optima. Considering the good 
quality of the present reconstructions, we postulate that only a slight improve- 
ment would be gained by employing more elaborate techniques such as the 
Gibbs Sampler [10] in conjunction with simulated annealing (SA) [10, 16]. 


13.4.2 Results 


'The algorithm described above has been implemented as a program written 
in C. All the timings mentioned later were measured under Linux on a state- 
of-the-art PC having 1.25 GB RAM and a 2.8 GHz processor. 

Stochastic methods are renowned not only for their ability to easily solve 
complicated problems, but also because they tend to be slow as compared to 
deterministic approaches. Therefore, considerable effort has been invested in 
optimizing the program code. For reference purposes, we summarize some of 
these. 


(a) As explained in Subsection 13.2.2, the continuous orientation space is 
discretized into finitely many quantized orientations. 

(b) Several look-up tables are utilized throughout the program to store pre- 
computed values of certain functions whose evaluation should be as fast 
as possible. One example is the table containing approximate values of 
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d(qi, q2), described in Subsection 13.2.2. In addition, the potentials asso- 

ciated with any 3 x 3 binarized grain map configurations are stored in a 

9D look-up table. 

(c) The objective functional y(f,0) needs to be completely evaluated only 
once, right before optimization is started. Whenever a pair (f’, o’) of maps 
is to be tested for acceptance as the new approximation of the optimal 
maps [with (f, o) being the current pair of maps], y(f’, 0’) is computed by 
updating ^(f, o) according to the changes caused by switching from (f, o) 
to (f^, o'). In particular, terms Hi(f,o), Hi(f', 0’), Ho(f), and H2(f’) are 
affected by only those 3 x 3 cliques that contain the pixel ? being modified. 
(There are only 9 such cliques.) Furthermore, the simulated projections 
Pv, and thus the error ||P, — P||,, can be directly derived from Po by 
subtracting the projections on the detector generated by the old orien- 
tation o(i) and then by adding those obtained with the new orientation 
o' (i). 

Numerous simulations have been performed to optimize the free param- 
eters of the algorithm and to quantitatively characterize the quality of the 
reconstructions as functions of magnitude of orientation spread within grains, 
those of degree of morphological complexity of grain maps, and of the artificial 
noise. 

Four 64 x 64-pixel test maps of aluminum samples of varying complexity 
were used. In all cases, the reference orientation map was generated by elec- 
tron microscopy; these maps are depicted in the upper left corners of Figs. 13.9 
to 13.12, respectively. They represent different complexities in terms of the 
number of grains and of the orientation spread inside grains: 


Case I: This map comprises 11 grains; on average, the orientation spread 
within grains is 0.0003427 (i.e., equivalent to the disorientation angle of 
= 3°), while one grain has a spread of 0.0018652 (7°). 

Case II: This map comprises 26 grains; the average orientation spread is 
around 0.0018652 (7°), but the spread within grains varies from 0.0003427 
up to 0.0183728 (3?—22?). The subdivision of some grains into subgrains 
is also noticeable. 

Case III: This map comprises 3 grains; the average orientation spread is 
0.0074538 (14°). 

Case IV: The material is, in this case, too deformed to comply with being 
“moderately deformed” as defined in the introduction of this section. 
Grains cannot be identified in an unambiguous way, and the orientation 
spread sometimes surpasses 0.0109841 (17°). This case is included to test 
the limitations of the algorithm. 


We remark that the gray scale used in Figs. 13.9 to 13.14 makes it difficult to 
see the orientation differences inside the grains for Cases I-III. 

For each case, first a reference grain map was determined from the ref- 
erence orientation map via a connected component technique. The next step 
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Fig. 13.9. The reconstruction of Test Case I using noiseless projections. (a) Ref- 
erence orientation map. (b) Initial orientation map. (c) Reconstructed orientation 
map. (d) Difference of the reference and the reconstructed grain maps. Black pixels 
denote identical grain labels; white pixels represent mismatching ones. (e) Differ- 
ence of the reference and the reconstructed orientation maps. The intensity of the 
pixels is determined by the distance (rotation angle) of corresponding orientation 
pairs, as shown in (f). 




















consisted of the determination of the center-of-mass pixel and the average 
orientation of each of the grains. These were later used to initialize (f,0) by 
assigning a grain label and the corresponding average orientation to every 
center-of-mass pixel (called seed points), and by setting the labels of the re- 
maining pixels to ambiguous. (The orientations of the latter were simply left 
undefined.) This procedure was meant to mimic the minimal outcome of an 
external method (e.g., GRAINSWEEPER [24]) to be used to retrieve such in- 
formation by the preprocessing of 3DXRD diffraction patterns. For Case IV, 
somewhat arbitrarily a set of 31 seeds was determined. 

Next for every test case, 3DXRD diffraction patterns associated with the 
reference orientation maps were simulated, optionally applying some synthetic 
noise as well. We generated 91 projections corresponding to equally spaced 
rotation angles from the interval [—45°, 45°]. The acquisition set-up imitated 
that of the 3DXRD microscope at the European Synchrotron Radiation Fa- 
cility (ESRF) at the time. 
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Fig. 13.10. The reconstruction of Test Case II using noiseless projections. Map 
arrangement and gray scales as for Fig. 13.9. 


In order to come close to the quality of real diffraction patterns, varying 
levels of artificial noise were applied to the simulated projections. The distor- 
tions of measurements are caused by different kinds of phenomena: scattering, 
fluctuation of the intensity of the X-ray beam, statistical error (the so-called 
Poisson or quantum noise), cross-talk between neighboring detector pixels, 
etc. Out of these, perhaps the Poisson noise is the most prominent; there- 
fore, only this sort of deviation was taken into account. Let us suppose that 
the intensity (photon count) Jp of every detector pixel is to be distorted by 
L% of noise. The noisy intensity noisy was then, as an approximation to the 
Poisson distribution, defined as a uniformly distributed random number taken 
from [Io (1 — is) , To (1 + o)l; subject to the constraint of nonnegativity. 
A trivial property of such noise is that Inoisy = 0 whenever Jp = 0. It should 
be stressed that this definition of noise is considerably different than that used 
for the simulation experiments in Section 13.3 (and in [1, 2]), so the results 
and conclusions cannot be directly compared. 

The Gibbs potentials were fixed to the values U introduced in Subsec- 
tion 13.3.2. The ratio Su was also held constant at v2, as suggested in [7]. The 
only remaining free parameters of the objective functional were the relaxation 
scalar B, the two weights A; and a that determine the relative importance of 
the three terms in (13.20), and the measure 6 of the orientation spread ap- 
pearing in (13.19). 
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Fig. 13.11. The reconstruction of Test Case III using noiseless projections. Map 
arrangement and gray scales as for Fig. 13.9. 


Based on the four test cases, associated input projections, and initial maps 
as defined above, a series of reconstructions was performed with varying sys- 
tem parameters a, 3, 41, and number of Metropolis steps, as well as different 
noise levels. (The value of 6 was always guessed from the reference orientation 
map and kept constant all the time. In particular, ô was set to 0.005 [11.5?], 
0.01 [16.2°], 0.008 [14.5?], and 0.021 [23.5?] for Test Cases I-IV, respectively.) 

The quality of the results were measured by two figure-of-merit functions, 
FOMa and FOMo. 


M 
IDP 
where M denotes the number of mismatching grain labels between corre- 
sponding pixels of the reference and reconstructed grain maps, and |D| is the 
number of pixels. The second figure-of-merit is related to the distance between 
the original orientation o?'5(;) and the resulting orientation o'**(i) for each 
pixel i: 


FOMg :=1-— (13.22) 


1 ; 
A aD D (o5), o**(à)) , (13.23) 
max ED 
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Fig. 13.12. The reconstruction of Test Case IV using noiseless projections. Map 
arrangement and gray scales as for Fig. 13.9. 


where, for i € D, o°8(i) and o'**(i) denote the orientations at pixel i in the 
original and in the reconstructed maps, and dmax represents the maximal pos- 
sible value of d(-,-), which equals about 0.1464466 (62.8?) for the face-centered 
cubic lattice. Evidently, FOMaG = FOMo = 1 for perfectly reconstructed 
maps, while FOMq z 0 and FOMo z 0.5 for a random reconstruction. 

Reconstructions tended to converge quite rapidly, needing not more than 
2.5 million iterations (corresponding to about 8 minutes of computer time, and 
to approximately 600 MCC as defined in Subsection 13.3.3). The variations of 
the FOMs with respect to the aforementioned free parameters turned out to 
be small, implying that the algorithm is robust. For the following, the values 
a= B = X1 = 1 have been employed. 

Figures 13.9 through 13.12 present the results for the four test cases using 
ideal diffraction patterns. The qualities of the reconstructions in the first three 
cases are all very good, with both FOM values being at or above 0.99. Re- 
markably, a high-quality orientation map—with FOMo = 0.986—is derived 
also for Case IV. (In this case, the grain map is deteriorated and thus FOMG 
is irrelevant.) 

Similar reconstructions for 10096 of noise are shown in Figs. 13.13 and 
13.14, and figures-of-merit are summarized in Fig. 13.15. (The latter plot was 
acquired by repeating every reconstruction 10 times using different seeds for 
the pseudo-random number generator, where the error bars indicate the stan- 
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dard error due to this variability.) The noise in experimental data is estimated 
to be of order 10%, so the effect is clearly exaggerated in these simulations. 
Nevertheless, the FOM values of the reconstructions remain high. 





(a) (b) 
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(e) 


Fig. 13.13. The reconstruction of Test Case I at 100% noise level. (a) Reference 
orientation map. (b) Reconstructed orientation map. (c) Difference of the reference 
and the reconstructed grain maps. Black pixels denote identical grain labels; white 
pixels represent mismatching ones. (d) Difference of the reference and the recon- 
structed orientation maps. The intensity of the pixels is determined by the distance 
(rotation angle) of corresponding orientation pairs, as shown in (e). 
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Fig. 13.14. The reconstruction of Test Case II at 100% noise level. Map arrange- 
ment and gray scales as for Fig. 13.13. 




















The impact of terms Hi(f,0) and Hə(f) was investigated by performing 
some experiments for Case II when one or both of these terms had been 
disabled. To get reliable statistics, all simulations were repeated 10 times 
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Fig. 13.15. Quality of the reconstructions as the function of noise level. (a) FOMc 
versus the level of noise. (b) FOMo versus the level of noise. 


using different seeds for the pseudo-random number generator. The results 
are presented in Fig. 13.16. It is evident that including at least one of the 
additional terms is beneficial at higher noise levels. 
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Fig. 13.16. Importance of various terms of the objective functional as the function 
of noise level. (a) FOMa versus the level of noise. (b) FOMo versus the level of 
noise. (Legend) Dotted: all terms; solid: Hi(f,o) + projections; dashed: H2(f) + 
projections; dashdot: projections only. 
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13.5 Discussion and Outlook 


We have demonstrated that the discrete tomography approach yields substan- 
tial improvements over the continuous approach (such as ART) when grain 
map and orientation map reconstructions are desired. The discrete nature of 
the reconstruction task was exploited in two ways: 


(a) By introducing labels and discrete grain maps. This has reduced the size of 
solution space by orders of magnitude, thereby enabling the use of stochas- 
tic routines. We showed that we provide reconstructions of a quality that 
is clearly superior to previous approaches. The stochastic routines do need 
a priori information in terms of initial maps. However, this requirement is 
not that demanding; for the case of undeformed grains, even the output 
from an indexing program may do. 

(b) By use of priors. To enable this, the methodology has been generalized 
from a two-phase (binary) to multiphase systems. The priors were shown 
to provide better maps in the cases where the reconstructions based only 
on the projections deteriorated (few projections, high noise, initial maps 
with little information). 


Work on reconstructions of real experimental data is currently pending for 
both the undeformed and the moderately deformed cases. 

The presented work serves as a basis for further improvements. We con- 
clude with briefly stating some important further directions. 


(a) The presented reconstructions were 2D. Three-dimension reconstructions 
are also of great interest; in producing them we will make use of (3D) 
Gibbs priors. 

(b) It is unclear how to make a trade-off among the accuracy of the initial 
grain map (with ambiguous areas), the running time of the restoration 
algorithm, and the actual noise level. 

(c) Real samples are usually not rectangular, so some pixels in the maps may 
correspond to air where there is no material to cause diffracted photons. 
The current implementation has some limited support for such void pizels, 
but their full-fledged handling will be a subject of future development 
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Discrete Tomography Methods for 
Nondestructive Testing 


J. Baumann, Z. Kiss, S. Krimmel, A. Kuba, A. Nagy, L. Rodek, 
B. Schillinger, J. Stephan 


Summary. The industrial nondestructive testing (NDT) of objects seems to be 
an ideal application of discrete tomography. In many cases, the objects consist of 
known materials, and a lot of a priori information is available (e.g., the description 
of an ideal object, which is similar to the actual one under investigation). One of 
the frequently used methods in NDT is to take projection images of the objects by 
some transmitting ray (e.g., X- or neutron-ray) and reconstruct the cross sections. 
But it can happen that only a few number of projections can be collected, because 
of long and/or expensive data acquisition, or the projections can be collected only 
from a limited range of directions. The chapter describes two DT reconstruction 
methods used in NDT experiments, shows the results of a DT procedure applied in 
the reconstruction of oblong objects having projections only from a limited range of 
angles, and, finally, suggests a few further possible NDT applications of DT. 


14.1 Introduction 


Many industrial applications need a procedure to get information about the 
structure of the object to be investigated in a nondestructive manner. X-ray 
tomography is such a technique, performing reconstructions of cross sections 
of the object from X-ray transmission projections. Nevertheless, the acqui- 
sition of such projection images can be an expensive and time-consuming 
procedure, and it is also possible that projections from certain directions can- 
not be taken. It is therefore important to be able to reconstruct from as few 
views as possible. An approach to achieving this is the application of discrete 
tomographic (DT) methods, in which only a special class of objects can be re- 
constructed, namely those comprising only a few homogeneous materials that 
can be characterized by known, discrete absorption values. Accordingly, the 
result of a DT reconstruction is an image having values only corresponding 
to the few known absorption coefficients. An overview of theory, algorithms, 
and applications of DT can be found in [4]. 

A substantial difference between classical computed tomography (CT) and 
discrete tomography (DT) is that in the latter it is assumed that the range of 
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the image function consists of finitely many known values. Moreover, in many 
cases, some a priori information is also available about the object under 
investigation; for example, the structure of the object is similar to that of 
a given template. An important application of discrete tomography is the 
industrial nondestructive testing (NDT), in which the internal structure of a 
specimen is to be determined without causing any damage. 

This chapter describes DT reconstruction algorithms used in NDT in Sec- 
tion 14.2. Also, the test results of the simulation and physical experiments 
are presented. In Section 14.3, we show that the application of these DT 
methods is suitable, for example, to increase the possible inspection size of 
single-material oblong objects. The increase of the object’s size with DT recon- 
struction compared to filtered back-projection (FBP) used in X-ray computed 
tomography (CT) was estimated to be above 50%. Finally, in Section 14.4, 
we suggest some further possible NDT applications of DT. 


14.2 Reconstruction of Pixel-based and Geometric 
Objects 


This section introduces two kinds of DT techniques, which can be applied to 
reconstruct two-level and even multilevel images (i.e., their range contains only 
two or even more than two values, respectively) from their projections. Both 
algorithms consider the reconstruction problem as an optimization problem 
to be solved by a method called simulated annealing (SA). The difference be- 
tween the reconstruction algorithms is the representation of the object under 
investigation. 

One of the reconstruction techniques is a pixel-based method that con- 
siders the object as a set of pixels, i.e., as a usual digital image. The main 
advantage of the pixel-based method is that it is general in the sense that it 
can be used for reconstructing any shape. This method is described in Sub- 
section 14.2.3. 

The other algorithm reconstructs objects consisting of circles, cylinders, 
and spheres made of homogeneous materials only. Such 2D and 3D geometric 
objects can be represented by a few parameters like radii, positions, heights, 
etc. In this case, the reconstruction means the determination of the parameters 
of the geometric objects; in other words, the optimization carries out a search 
in the space of parameters. This parameter-based method is presented in 
Subsection 14.2.4. 

In order to assess the efficiency of these techniques, we performed sev- 
eral simulation experiments. We were also interested in how certain recon- 
struction parameters (e.g., the number of projections) or the amount of noise 
affect the reconstructed image. These results and conclusions are shown in 
Subsection 14.2.5. We had the opportunity to test both algorithms also for 
reconstructing real, not necessarily binary, objects. The results of these X-ray 
experiments are given in Subsection 14.2.6. 
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'The physically measured projections are not suitable for immediate re- 
construction due to several properties and defects of the image acquisition 
system (e.g., nonuniform sensitivity during acquisition and on the detector 
plate, bright specks, presence of statistical noise, etc.). Hence, some prepro- 
cessing steps are necessary to reduce these effects. These correction steps are 
summarized in Subsection 14.2.6. 

'The algorithms presented here have been incorporated into the system 
called DIRECT (discrete reconstruction techniques) [17], which is a frame- 
work for testing and visualizing various DT methods. For more details about 
implementation, see [7, 10]. 


14.2.1 Reconstruction as an Optimization Problem 


'The usual tomographic imaging procedure is to collect projections of the ob- 
ject using some transmission rays, like X- or neutron rays. The rays transmit- 
ted through the object are partially absorbed by the materials comprising the 
object. The relation between the initial and transmitted (unabsorbed) ray in- 
tensities, /s and Ip, respectively, can be expressed as a function that depends 
on the absorption coefficient (u) of the object, that is, 


D 
- f n(u)du 
Tei ates ti (14.1) 


where the integral is taken on the line between the source (S) and detector 
(D). This equation is a basic relation in transmission tomography, in which 
the cross sections of the object are to be determined from such measurements. 

Mathematically, transmission tomography is modeled by the Radon trans- 
form, Rf, which gives the line integrals of a two-dimensional integrable func- 
tion f. Formally, 


[Rf] (8,9) = / f(x, y) du , (14.2) 


where s and u denote the variables of the coordinate system rotated by 90. 
The function Rf for a fixed value of J is also called the -angle projection 
of f. 

Returning to the physical model of transmission tomography, hereafter let 
f denote the absorption coefficient of the 2D object being studied. Then the 
Ü-angle projection of f, [R.f](s, 9) can be computed from the transmission 
measurements, Ip(s, V) after a suitable logarithmic transform. That is, we 
are looking for an f such that 


Is 
R 8) = log ———— . 14.3 
[RF] (s.9) = log 1 (14.3) 
Now the reconstruction problem can be posed as one where the goal is 
to find a function f such that its projections are equal to the right side of 
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(14.3). In other words, we are looking for the inverse of R. The function f is 
sometimes called the image function or, briefly, the image. 

Let Ps(s) denote the acquired projection of angle 0 [i.e., the right side of 
(14.3)] as a function of s for a given direction 9. Similarly, let [Rf] (9) denote 
the projection of the image f taken at angle 9, which is also a function of 
s. Both methods to be presented consider the reconstruction problem as an 
optimization task that minimizes the objective functional 


C(f) = X IRF (9) — Pol? , (14.4) 
D 


where ||.|| denotes the usual Euclidean norm. So, using the formalism of (14.4), 
the aim of this optimization is to find the image function f, whose correspond- 
ing projections are the most suitable for the input data. 

'To solve the optimization tasks associated with the different kind of rep- 
resentations, we used the method of simulated annealing (SA). 


14.2.2 Simulated Annealing (SA) 


Simulated annealing (SA) is a random-search technique that is based on the 
physical phenomenon of metal cooling. The system of metal particles gradually 
reaches the minimum energy level where the metal freezes into a crystalline 
structure. 

The algorithm (see Fig. 14.1) starts from an arbitrary initial binary image 
a = aO and an initial (high) temperature T), and calculates the objective 
functional value C(x). Then a position j is randomly chosen in the recon- 
structed image x. Let z' be the image that differs from x only by changing 
the value of x in position j to the other binary value, i.e., xi = 1 — zj. This 
change is accepted by the algorithm, i.e., x is replaced by 2’, if C(a’) < C(x). 
Even if the objective functional does not get smaller, the change is accepted 
with a probability depending on the difference AC = C(x’) — C(x). 

Formally, the change is accepted even in that case when 


exp(- AC/KT) >z, (14.5) 

where &, T, and z are, respectively, the Boltzmann constant (11.3805 x 10773 x 
m?kg s ? K -!), current temperature, and a randomly generated number from 
a uniform distribution in the interval [0, 1]. Otherwise, the change is rejected, 
i.e., z does not change in this iteration step. If a change is rejected, then we 
test the level of efficiency of changes in the image in the last iterations. It 
means that we count the number of rejections in the last Mi iterations. If 
this number is greater than a given threshold value Rtnr, then the efficiency 
of changes is too low and the SA optimization algorithm will be terminated. 
We calculate the variance of the cost function in the last N,4, iterations. 

A so-called equilibrium state is said to be attained if the present estimate of 
the current AC variance is greater than the previous variance estimate. If the 
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Fig. 14.1. Flowchart of the implemented SA algorithm. 


equilibrium state is achieved, we reduce the current temperature (allowing 
changes with smaller probabilities when the value of the objective functional 
is greater) and let the algorithm run with a lower temperature value (T is 
replaced by h- T, where h is the cooling factor). In our experiments, we chose 
the same value for this parameter as in [12], namely ^ — 0.9. 

Finally, we implemented the SA as follows. We change all the x-vector ele- 
ments from 0 to 1 or vice versa in one iteration step. The changes are accepted 
or rejected as it is described. When one iteration has been done, we reduce the 
current temperature and start a new iteration step. The algorithm stops when 
the ratio of the value of the current objective function and that of the starting 
objective function fall below a given threshold value (C(z')/C(x() < Cinr) 
or the current temperature is less then a given temperature (T < Ti). 
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14.2.3 The Pixel-based Reconstruction Method 


Since industrial objects are often made of more than two materials, it was 
desirable to enable the technique to reconstruct not only binary but multi- 
level images as well. In this case, the pixels can take their intensity values from 
D = {di,..., dı}, i.e., | > 2 denotes the number of possible intensity values 
(levels). The obvious extension of the binary modification rule to multilevel 
image is a uniformly distributed random choice of an intensity level from D. 
In order to improve the efficiency, we add to (14.4) a regularization term 
(f) that includes the a priori information about the image function to be 
reconstructed. So the objective that we used in our pixel-based method is 


C(f) 2 M IRA O) — Poll? +y 6), (14.6) 
oe 


where y > 0 is the so-called regularization parameter, which determines the 
weight between the two terms. A small y prefers a result that matches better 
the input projections, a big y yields an outcome that is more appropriate for 
the a priori information. If the image to be reconstructed contains a few large 
regions made of homogeneous materials, this information may be modeled by 
smoothness, provided by the functional 


exp ( ees ) 


of) =—D DF (@—uy-2) — Áo, 0 (14.7) 


x,y U,v 


where 0}, 02, H1, and u2 are suitable constants. 


14.2.4 The Parameter-based Method 


Our second reconstruction method takes the assumption that the object is 
composed of a tube encompassing a solid cylinder called the interior (i.e., the 
inner space of the tube), which contains a known number of disjoint solid 
spheres or cylinders made of homogeneous materials. Furthermore, it is also 
assumed that at most four different homogeneous materials constitute the 
object, namely, 


(a) the material of the tube, 

(b) the material of the interior, 

(c) the material of the spheres and cylinders, and 

(d) the background surrounding the object, which is usually air or vacuum. 


Since spheres as well as cylinders can be described by a few parameters 
like center, height, and radius, each object can be represented as a vector 
of parameters called a configuration. In order to perform a truly 3D recon- 
struction, the optimization of the objective functional C(f) is performed in 
the parameter space iteratively. Starting off from an initial configuration, the 
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current configuration is altered at every iteration step to produce a better 
approximation of the object to be reconstructed. The modified configuration 
may be accepted only if it satisfies certain geometric restrictions. A more 
detailed description of the algorithm can be found in [7, 10]. 


14.2.5 Simulation Studies 
Reconstruction of Pixel-based Objects 


We reconstructed several multilevel phantom objects. One of the phantoms 
and one of its projections can be seen in Fig. 14.2(a) and (b). The reconstruc- 
tions using the objective functionals (14.4) and (14.6) with the regulariza- 
tion term (14.7) are shown in Fig. 14.2(c) and (d), respectively. Clearly, the 
smoothness term improved the quality by preferring images having connected 
regions with constant values. 





(d) 


Fig. 14.2. The three-level phantom image and its reconstructions. (a) The phantom 
image used for simulation studies. (b) One of the projections of the object in (a). 
Reconstruction results from 12 noiseless projections, 400 measurements/projection 
(c) using the objective functional (14.4), and (d) using the objective functional of 
(14.6) and (14.7). 


Reconstruction of Parameter-based Objects 


'The aim of this simulation experiment was to examine the effects of geomet- 
ric complexity of the 3D object when using only two noisy projections. The 
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object was a tube containing spheres with different diameters. The complex- 
ity of the object was determined by the number of spheres in the tube. As 
Fig. 14.3 shows, it is, generally, hard to produce an acceptable result from 
two projections if there are five or more spheres. The objects are presented 
using the virtual reality modeling language (VRML97 [19]) and reconstructed 
by DIRECT [17]. 


# of Original Result Difference 
sph. 





Fig. 14.3. Reconstruction by parameter-based method using different numbers 
of spheres (parameters: 10% noise, two 2D projections, 100 x 100 measure- 
ments/projection). First column: number of spheres. Second column: original ob- 
jects. Third column: reconstructed objects. Fourth column: differences between the 
reconstructed and original objects (only mismatching voxels are painted). 


The influence of the noise can be observed in the reconstructed images in 
Fig. 14.4. 


14.2.6 Experimental Results 


We had the opportunity to test our techniques using X-ray projection data 
of real physical phantoms. One of these physical experiments is presented in 
this subsection. 


Test Object 


The object was a reference cylinder; a solid cylinder made of plexiglas, contain- 
ing three cylindrical bores of different diameters and depths in an asymmetric 
arrangement [see Fig. 14.5(a)]. The lower part of the deepest hole was filled 
with aluminium screws, as shown in Fig. 14.5(b). 
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Noise Original Result Difference 
(%) 








Fig. 14.4. Reconstructions by the parameter-based method from noise-free and 
noisy projections (parameters: three spheres, two 2D projections, 100 x 100 mea- 
surements/projection). First column: noise level. Second column: original object. 
Third column: reconstructed objects. Fourth column: differences between the recon- 
structed and the original object. 














(b) 


Fig. 14.5. Reference cylinder. (a) Structure of the reference cylinder used in the 
X-ray experiments. (b) One of its projections. 
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Data Acquisition 


The apparatus was suitable to collect not only X-rays but also other kinds of 
rays (neutron or gamma) as well (see Fig. 14.6). The object to be investigated 
is placed on a table rotated by a stepper motor, thus letting the beams trans- 
mit through the object in different directions. The beams attenuated by the 
object impact into a scintillator, which transforms the detected radiation into 
visible light detected by a CCD camera. Since the camera can be damaged 
by direct exposure to radiation, an optical mirror system conveys the light 
from the scintillator to the CCD camera. The images taken by the camera are 
stored temporarily by the camera controller, and finally a dedicated PC reads 
out the raw image data from this storage. A more detailed description of the 
imaging apparatus can be found in [1]. 
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Fig. 14.6. Imaging apparatus for collecting projections. 


Preprocessing 


Due to several distorting effects of the data acquisition system, the measured 
projections of the physical objects were not suitable for immediate reconstruc- 
tion. In order to diminish the effects of these distortions, some corrections 
(preprocessing steps) were performed on the projection data. In the case of 
the reference cylinder, these steps were the following: 


(a) Logarithmic transformation of the measured data according to (14.3). This 
step is necessary to get the line integrals of the absorption function to be 
reconstructed. 
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(b) Due to the fluctuating flux of the rays and/or the changing sensitivity 
of the imaging system in time, the average intensity of the projections 
varied from projection to projection. In order to eliminate this effect, we 
performed intensity correction by multiplying the projections by suitable 
constants. 

(c) A further distorting effect was caused by white points randomly dis- 
tributed in the projections. To reduce their effects, we applied a thresh- 
olded median filtering. 


Further details about the correction steps and their effects on the recon- 
struction can be found in [10]. 


Determination of Intensity Levels 


The next problem to be solved before reconstruction was that the exact inten- 
sity levels of the cross-section images (i.e., the exact absorption values) were 
unknown. Anyway, in real physical experiments, only approximative absorp- 
tion values can be given, since only average absorption coefficients could be 
used for polychromatic X-rays. This fact violated one of our basic assump- 
tions, namely, the absorption coefficients of the few materials making up the 
object should have been known exactly. So we had to find a technique to 
approximate the right absorption values. 

In the case of the parameter-based method, the absorption values could 
be calculated in virtue of the object geometry. In the pixel-based case, our 
idea was that we approached the image f of the right (but unknown) values 
with another image f’ having more given values approximating the right ones 
from both directions. As a first approximation of the right intensity values, we 
could take the local maxima in the histogram of f’. The set of the increased 
number of intensity levels of f^ was produced by the equidistant division of 
the interval of the possible levels of f. For example, if the intensity levels 
are from the interval [0,1], and 41 intensity levels are picked from that, the 
increased set of levels is {0,1/40,2/40,...,1}. 

In the case of the reference cylinder, even the background was subtracted, 
because it was very noisy, and it degraded the result. Despite this step, the 
image still remained a three-level one (having a background value near 0). 
A reconstruction result, which was done by using 41 intensity levels from 18 
projections, can be seen in Fig. 14.7. The obtained intensity levels can be 
considered only as an approximation of the right ones. As a following step, we 
took the local maxima of the histogram of this reconstruction result as the 
intensity levels that could be used for the multilevel DT reconstruction. In 
Fig. 14.8, the histogram of Fig. 14.7 is presented. It can be seen that there are 
three peeks in the histograms (see pointers in Fig. 14.8). The corresponding 
intensity levels were used in the multilevel DT reconstruction. 
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Fig. 14.7. A reconstruction result (155 x 155) of the pixel-based method using 41 
intensity levels. 
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Fig. 14.8. Histogram of the reconstructed cross section in Fig. 14.7. 


Pixel-based Reconstruction Results 


With the preprocessed projections and intensity values described so far, 
we performed the pixel-based reconstruction. Its result may be seen in 
Fig. 14.9(a). For the sake of comparison, we performed ART reconstruction 
(1000 iterations, relaxation parameter 0.001), which yielded the outcome vis- 
ible in Fig. 14.9(b). 


Parameter-based Reconstruction Results 


In addition to the software experiments mentioned in Subsection 14.2.5, we 
had the opportunity to try the parameter-based reconstruction method on 
physically measured data too. In the experiments, we used the reference cylin- 
der introduced in Subsection 14.2.6. [One of its X-ray projections is shown in 
Fig. 14.10(a).] Because of the assumption that every cylindrical hole is filled 
with the same material, the lower half of the projections had to be discarded 
[see Fig. 14.10(b)]. The model reconstructed from four projections and the dif- 
ference of the original and the reconstructed model are seen in Fig. 14.10(c) - 
(f), respectively. For a more detailed description of these physical experiments, 
see [7, 10, 11]. 
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Fig. 14.9. Reconstruction results. (a) A pixel-based reconstruction result of the 
cross section denoted in Fig. 14.5(b) using three intensity levels. (b) An ART re- 
construction yielded by SNARK93 [18]. 
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Fig. 14.10. Projection and reconstruction results of the Plexiglas object given in 
Fig. 14.5. (a) One of the projection images of the whole object. (b) One of the pro- 
jection images of the upper half of the object. (c) 0° view of the reconstructed model. 
(d) 90° view of the reconstructed model. (e) Top-down view of the reconstructed 
model. (f) Difference between the reconstructed and original models. 
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14.3 Reconstruction from Limited View Angles 


The scope of this section is to point out a further potential of DT for NDT 
with X-ray CT. A key problem for X-ray CT in NDT is the strong absorption 
in metal objects. We aim to increase the inspection size of oblong objects that 
are made of a single material and for which X-ray penetration is limited for a 
few directions [see Fig. 14.11(a)]. This is a special case of limited view angle 
tomography. It is clear that filtered back-projection (FBP) is not suitable as 
a reconstruction method because it requires projection data from many view 
angles on a circular source trajectory, i.e., usually a 180? scan for parallel beam 
geometry and a 360? scan for fan beam geometry. If we use FBP despite the 
lack of data, limited view angle artifacts will arise. A compensation is possible 
with DT using the preknowledge about the material composition of the object. 
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Fig. 14.11. (a) Special case of limited view angle tomography for oblong objects. 
(b) Illustration of one projection equation: It is a projection along a single X-ray 
onto one detector pixel. The image pixels that contribute to the projection are drawn 
shaded. 


Nevertheless, FBP is the most popular technique in NDT because of its 
high image quality, fast implementation, and robustness against distortions in 
the data, e.g., noise, polychromatic X-rays, and scattering. The ultimate goal 
of this work is the reconstruction of experimental data with DT for which 
high data quality is crucial. Before starting with the experiments, we simu- 
lated projection data and analyzed the sensitivity of DT to distortions. For the 
experiment itself, we compared a micro-focus CT system with a flat-panel im- 
age (FPI) detector at Siemens to a special line detector set up at the EMPA.! 
All investigations were performed with 2D tomography and a special phantom 
that features relevant NDT tasks like crack and void detection as well as di- 
mensional measuring. A beam-hardening correction was also performed. The 


1 Eidgenössische Materialprüfungs- und Forschungsanstalt, Dübendorf, Switzer- 
land. 
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simulations and experiments were performed by Siemens. The reconstructions 
using discrete tomography were done at the University of Szeged. 

Let us consider again the optimization task given in (14.6) using the 
smoothing regularization term (14.7) and the optimization method simulated 
annealing described in Subsection 14.2.2. 


14.3.1 Experimental Issues 
Investigated Object: Bat Phantom 


We designed a phantom that is a model for a number of nondestructive testing 
problems. It is made of copper (strong absorber) with gaps (crack detection), 
drills (void detection), curved and linear shapes (dimensional measuring). Fig- 
ure 14.12 shows a photo of the manufactured phantom. This “bat phantom” 
has a cross-sectional size of 72mm x 16mm, and the smallest drill diameter 
and gap height are 0.5 mm. The phantom length of 70 mm is a critical size for 
X-ray penetration in copper at 200kV. This size was defined with the help 
of simulated data for different conditions (namely, object size, voltage, and 
dynamic range). 


Fig. 14.12. Photo of the investigated bat phantom (cross section 70mm x 16 mm). 


Selecting an Appropriate Experimental Set-up to Avoid Scattering 
and to Achieve a High Dynamic Range 


Experimental measurements were done at two different setups for CT. The 
first one is a 225 kV micro-focus system with a 2D FPI detector at Siemens 
CT PS9. The other is a 450 kV mini-focus X-ray tube with a 1D line detector 
at the EMPA. The principal configuration of the two systems is sketched in 
Fig. 14.13. 

'The 225 kV micro-focus system is designed for high-resolution 3D cone 
beam computed tomography of small parts. The object is usually magnified 
by a large factor, and the spatial resolution is defined by the focal spot size 
of the X-ray tube (typ. Ø 1-100 um). At the same time, the tube power is 
limited (typ. « 100 W), and the acquisition time for one projection is rather 
large. A 2D area detector is necessary to achieve reasonable scanning times 
in 3D micro-focus CT. The advantages of our FPI detector are a high spatial 
resolution (in our case, 2048 x 2048 pixels of 0.2-mm pixel size) and good 
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noise properties. A disadvantage is that strong internal scattering affects the 
image quality if large and strong absorbing objects are investigated at higher 
energies, e.g. the bat phantom at 200 kV. This is a commonly known problem 
for FPI detectors in NDT [15], and the usable dynamic range (maximum 
detectable intensity/minimum detectable intensity) is significantly reduced 
for such experiments. Scattering from the object plays a minor role for this 
set-up because the object-detector distance is quite large (0.5-1 m). 





Fig. 14.13. Geometric configuration of the investigated CT systems. (a) Micro- 
focus CT with FPI detector and (b) mini-focus CT with line detector. 


The 450 kV mini-focus system uses 2D fan beam geometry and a line 
detector whose pixels are spaced in equal angular increments (Fig. 14.14). It is 
designed for the inspection of larger parts with a lower resolution. To enhance 
penetration possibilities, voltages up to 450 kV can be used. A high tube power 
(1 kW) allows reasonably fast scanning, but the focal spot size is large (Ø 
2.5 mm). If the investigated object is not magnified (or a small magnification 
is used), the spatial resolution will be determined by the detector pixel size. 
As the detector pixel spacing (2.1 mm) is large and the number of detector 
channels (125) is small, subpositions are scanned to achieve a resolution of 0.1— 
1mm. For our experiments, an acquisition mode has been used that yielded 
parallel beam projection data? with 0.23-mm resolution. The most important 
advantage of the EMPA system is its special line detector that avoids X-ray 
scattering. It has a tungsten collimator to suppress scattering coming from 
the object (important because of a small object-detector distance). Moreover, 
it uses large and strong absorbing scintillator cells (6-mm CdWO,) such that 
almost no radiation can pass into the detector housing and cause internal 
scattering. Therefore, the data are not distorted, which is a prerequisite for 


? This is possible by a translational movement of the object parallel to the detector, 
rotation by the fan angle and resorting of the projection rays. 
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DT reconstruction. Furthermore, the usable dynamic range and the possible 
material thickness for inspection are much larger as compared to the FPI 
detector. 


P" Scintillator 6mm CdWO, 


e. 
450kV 
Mini-focus 
X-ray Source 


Fan Beam mp 2... 


Tungsten Collmater 
(Colkmation within and also 





perpendicular to the mage plane) 


Fig. 14.14. Schematic of EMPA set-up. Scattering is avoided by a special line 
detector. 


In order to study the effects of polychromatic X-ray and its correction, a 
special phantom object was designed and used in the experiments. The details 
of these experiments are given in [8]. 


14.3.2 Results 
Simulation Results 


To analyze the performance of discrete tomography, simulated projections 
have been used. We started with ideal monochromatic data and then investi- 
gated how noise and polychromatic X-rays affected the reconstruction. Scat- 
tering has not been considered in the simulations. The simulation was done for 
ideal fan beam geometry, i.e., a point source and a linear detector (400 detector 
pixels, 360 projections in 360°, image pixel size for reconstruction 0.32 mm). 
The detected intensities were computed by line integrals and the exponential 
absorption law. Using floating-point arithmetic, the dynamic range was infi- 
nite at first, i.e., no views with limited number of digits occurred. In order 
to produce views with a limited number of digits, we reduced the dynamic 
range by setting all values below a certain threshold to this minimum value. 
For all monochromatic examples, the energy was 160 keV. For the polychro- 
matic case, a linearly approximated tube spectrum at 160 kV was used with 
an additional beam-hardening filter of 1.5-mm copper. This approximation is 
sufficient here because the spectrum is not too different from a real one and 
we look at the problem from a qualitative point of view. 
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The reconstructions were performed with filtered back-projection (FBP) 
and discrete tomography (i.e., reconstruction of pixel-based objects). The ex- 
ecution of FBP needed 5 sec on a 2 GHz CPU. This compares to 10 min for 
one DT reconstruction (10° iterations). Furthermore, the DT reconstructions 
were repeated 50 times and averaged to give a better and more stable result 
(repeated reconstructions do not give identical results with DT thanks to the 
stochastic behavior of SA). The total reconstruction time for 50 repetitions 
was 8hr. As an additional feature, a smoothness penalty term has been used. 
Different settings will be mentioned explicitly. 


Simulation for the Ideal Case 


The first result we show is for the ideal case, i.e., monochromatic data, no 
noise, and infinite dynamic range. In Fig. 14.15, the FBP reconstruction for a 
high sampling with 800 detector pixels and 2880 projections is shown. No arti- 
facts are visible in this image, and we can take it as a reference for comparison 
with the other results. To point out deviations between different reconstruc- 
tions, we will frequently use vertical line profiles across the most critical region 
of the bat phantom as indicated by the arrow. 

For the investigation with DT, we used a low sampling of 400 detector 
pixels and 360 projections. This optimized the reconstruction time, and for 
DT it is expected to produce no errors (DT is also suitable to reconstruct 
objects from a very small number of projections). The reconstructions from 
low sampling with FBP and DT are comparable on Figs. 14.16(a) and (b). A 
closer look at the vertical line profiles [Fig. 14.16(c) and (d)] reveals ripples 
(aliasing artifacts) for FBP due to the low sampling, but the geometric shape 
does not change. The profiles are compared to FBP with high sampling, for 
which only some minor spikes remain (the sampling is still not high enough). 
The DT result is exact within the pixel resolution. This result was computed 
without a smoothness penalty term as it did not change anything here. Note 
that the line profiles for the low sampling were resampled to match the high 
sampling case. 


Fig. 14.15. FBP reconstruction for the ideal case with high sampling. The arrow 
indicates the position where line profiles will be taken. 
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Fig. 14.16. (a) FBP and (b) DT (no smoothness penalty term) reconstruction 
for the ideal case (with low sampling). The corresponding vertical line profiles are 
shown in (c) and (d). They are compared to FBP with high sampling. 


Simulation with Reduced Dynamic Range 


'To simulate a real detector, we reduced the dynamic range to 8.5 bits, which 
was found in first experiments. Figure 14.17(a) shows the formation of limited 
view angle artifacts for the FBP reconstruction. The number of artifacts is 
quite large here, and the problem can be seen as difficult to solve. For discrete 
tomography, we tested reconstructions with different options (see the details in 
[8]). In this case, a solution could be found only with the help of a smoothness 
penalty term [Fig. 14.17(b)]. The implementation of the smoothness penalty 
term was a major success of our work. It is suitable not only to suppress 
branches but also reduces the sensitivity to other distortions, for example, to 
noise. The overall stability of the algorithm was significantly enhanced. 





(a) (b) 


Fig. 14.17. Reconstructions for 8.5-bit reduced dynamic range. (a) FBP, (b) DT 
(50 repetitions, with smoothness penalty term). 
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Comparing the vertical line profile of this result (with smoothness penalty 
term) to the ideal case with infinite dynamic range [Fig. 14.18(a)], we observe 
some remaining deviations of the object shape. Here the limits for reduced 
data can be seen. The maximum penetration length for X-rays in copper at 
160 keV and 8.5-bit dynamic range is 32 mm. This is also the maximum 
object length for which FBP reconstruction is exact. Considering that the 
investigated object length was 70mm, the DT result is quite impressive. This 
proves that DT is suitable to increase the possible object length for inspection. 


Vertical line profiles across reconstructed bat phantom Vertical line profiles across reconstructed bat phantom 
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Fig. 14.18. Vertical line profiles for DT reconstructions. (a) 8.5-bit reduced dynamic 
range compared to infinite dynamic range. (b) 296 noise compared to no noise for 
8.5-bit reduced dynamic range. 
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Simulation with Noise and Reduced Dynamic Range 


In Fig. 14.19, we show the impact of 2% (relative to the free ray intensity) 
quantum noise that was added to the projection data in the case of 8.5- 
bit reduced dynamic range. An impression for the amount of noise can be 
obtained from the FBP reconstruction [Fig. 14.19(a)]. The DT reconstruction 
[Fig. 14.19(b)] is almost unchanged compared to the case without noise. This 
is clearly visible in the vertical line profile [Fig. 14.18(b)]. Only for larger 
noise levels, e.g., 596, we observed deformations and distortions of the object 


(a) (b) 


Fig. 14.19. (a) FBP and (b) DT reconstruction from noisy data (296 noise added 
to the case of 8.5-bit reduced dynamic range). 
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(not shown). The stability against moderate noise is an important capability 
considering experimental data. An explanation may be that the distortions 
are statistically distributed and the correct object is still the best fit to the 
projection data. 


Experimental Results 


From the simulation studies we found that a beam-hardening correction was 
necessary for good reconstructions with DT. Furthermore, we can expect that 
scattering is also very critical because it is a systematic distortion as poly- 
chromatic X-rays are. The measurements were taken with two different set-ups 
(Subsection 14.3.1). One was at Siemens (micro-focus X-ray tube and FPI de- 
tector) and the other at the EMPA (mini-focus X-ray tube and special line 
detector). If not explicitly mentioned, we used 200 kV and a 1.5-mm Cu filter 
for our experiments. The acquired data were down-sampled for reconstruc- 
tion: EMPA (parallel beam geometry, 330 detector pixels, 312 projections in 
180°, image pixel size for reconstruction 0.23mm) and Siemens (fan beam 
geometry, 400 detector pixels, 360 projections in 360°, image pixel size for re- 
construction 0.23 mm). The DT reconstructions were performed as described 
for the simulation part. 


Reconstruction Results from Experimental Data 


First we will show the reconstruction results for the FPI detector. From the 
discussion above, we expect a strong impact because of the massive scattering 
effects. For FBP [Fig. 14.20(a)] we observe huge limited view angle artifacts 
and a strong smearing in the reconstructed image. With continuously increas- 
ing distortions, the quality of the FBP result decreases continuously. This 
kind of stability is a major strength of FBP. The behavior of DT is different, 
however. Due to the large amount of distortions, the reconstruction with DT 
is not possible any longer [Fig. 14.20(b)]. The object “breaks up” at some 
level of distortions. A beam-hardening correction was not feasible here. 


a) b) 


Fig. 14.20. Attempt to reconstruct data from FPI detector (200 kV) with (a) FBP 
and (b) DT. 


For the EMPA line detector, the data quality is much better. With FBP 
reconstruction of the raw data, we observed only very small limited view angle 
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artifacts (not shown). To make the problem a bit more challenging, we reduced 
the dynamic range in a preprocessing step to 10 bits. In a second preprocessing 
step, the beam-hardening correction, which was described in [8], was carried 
out. The FBP result is shown in Fig. 14.21(a). Now the reconstruction with 
DT is also possible [Fig. 14.21(b)]. Up to this point, all experiments were 
performed at 200 kV. Another way to solve the problem is to increase the 
voltage. At 450 kV the limited view angle artifacts in FBP disappeared almost 
completely (Fig. 14.22). 


py 


(a) (b) 


Fig. 14.21. (a) FBP and (b) DT reconstruction with beam-hardening correction 
for line detector (200kV). 





Fig. 14.22. FBP reconstruction with beam-hardening correction for line detector 
(450 kV, high sampling). 


For a precise analysis, we compared the vertical line profiles of the DT 
result at 200kV to the FBP result at 450kV (Fig. 14.23). The positions of the 
profiles relative to the object were the same as in the simulation part and were 
indicated in Fig. 14.16. Considering the shape of the reconstructed object, we 
found that DT (200kV) and FBP (450kV) are in good agreement. A closer 
look reveals that the FBP case is not perfect, because the X-ray penetration is 
still not fully possible and some artifacts remain in the reconstructed image. 
The upper center bar is brighter in the image, and the corresponding first 
peak in the vertical line profile is widened a little bit. The geometric shape 
for the DT reconstruction appears to be quite good. The full width at half 
maximum of the peaks in the vertical profile is 2mm for the first peak and 
2.1mm for the second peak as compared to 2.1mm in the original object. As 
the spatial resolution was only 0.23mm, the accuracy was not very high at 
this stage. This might also be the reason why the smallest drill hole appears 
slightly too small here. The final result clearly shows that DT can be applied in 
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experiments with high data quality to increase the possible inspection size of 
oblong objects. To quantize the improvement, the object size (70mm) can be 
compared to the penetration limit (37 mm). But FBP may also be suitable to 
reconstruct objects that are a bit larger than 37 mm, because small artifacts 
can be tolerated. We estimate that the improvement in object size of D'T 
compared to FBP was above 50%. 


Vertical line profiles across reconstructed bat phantom 
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Fig. 14.23. Plot for line detector: comparison of vertical line profiles for DT 
(200 kV) and FBP (450 kV, high sampling) reconstructions. 


14.3.3 Discussion and Further Research 


We implemented two methods. The pixel-based one seems to be promising 
for the reconstruction of industrial objects (made of a few materials) from a 
small number of projections. It gave good results even in nonideal, for exam- 
ple, in noisy circumstances using a priori information. Since the absorption 
coefficients were not known exactly we implemented a technique to retrieve 
their approximative values. 

If the object to be reconstructed is known to be mathematically repre- 
sentable by some parametric functions (e.g., simple geometric primitives like 
spheres or cylinders), the usage of such a priori information may greatly im- 
prove the quality of the reconstructed image, even when fewer projections are 
available. This is demonstrated by the fact that the parameter-based recon- 
struction method shows high robustness even if the projections are degraded 
by 4096 of the additive noise. 

It is also clear that the parameter-based algorithm is inapplicable if the 
object cannot be described in a simple geometric way, or when the materials 
of which the specimen is made are not homogeneous. On the other hand, the 
pixel-based method is unaffected by these circumstances, so it can again be 
employed successfully. Furthermore, the pixel-based algorithm may benefit 
from using more intensity levels, whereas the parameter-based one is limited 
to four absorption values only. 
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Though the techniques presented here gave promising and acceptable re- 
sults, we are planning to improve their effectiveness in the future. It would be 
desirable, for both methods, to speed up the optimization procedure and to 
test them on more physically measured projections. The pixel-based technique 
should be made less sensitive to noise by incorporating a noise model. Finally, 
some practical extensions of the parameter-based algorithm would be to al- 
low for more than four materials and to use more complex prior knowledge 
(model) of the object. 

We proved that DT is suitable to increase the possible object size of ob- 
long, single-material objects. The dimensions for the bat phantom were chosen 
according to similar NDT problems that are difficult to solve. The limits for 
DT reconstruction were analyzed via simulations. For the best results, the 
implementation of a smoothness penalty term was an important step. Adding 
moderate noise of 2% did not change the result, but polychromatic X-rays 
caused deviations in the object’s shape. We concluded that a beam-hardening 
correction is necessary for experimental data. In the experimental part, we 
investigated a micro-focus CT system with a FPI detector at Siemens and a 
mini-focus CT system with a special line detector at the EMPA. The EMPA 
line detector very efficiently avoids scattering and acquires high-quality data, 
while the data of the FPI detector are strongly distorted by scattering for this 
type of experiment (large and strong absorber at high energy). As expected, 
DT reconstruction was not possible for the FPI detector data. In contrast to 
this, the EMPA system delivered good results. The improvement in object 
size, compared to FBP, was estimated to be above 50%. 


14.4 Proposed Combination of CAD Data and DT 


There are many problems in technological applications of tomography, such 
as the examination of turbine blades for geometrical perfection and for ob- 
struction either by remaining wax after the casting process or by coking after 
extended use. The detection of such a third material only makes sense using 
neutron radiography, where metals can be well penetrated while keeping a 
high sensitivity for hydrogenous materials such as wax and coking, which is 
not possible with X-rays. For measurement of deviations of the ideal shape, 
either radiation can be employed. 

High-quality neutron computed tomography is available at Paul-Scherrer- 
Institute, Switzerland, at Technische Universitat München, Germany [2], and 
also at NIST, USA. 


14.4.1 Turbine Blades and Machine Parts 


Modern turbine blades are hollow and contain channels for gas cooling by the 
centrifugal force. Gas (air) enters the blade at the bottom, flows through a 
zigzag channel, and is drained through small-diameter bores at the trailing 
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edge of the fin. This internal cooling allows the blade to run at an external gas 
temperature well above the melting point of its alloy, which enables the turbine 
to run with higher efficiency. Figure 14.24 shows the neutron radiography of 
a large turbine blade with cooling channels and particles used for internal 
cleaning after production, which had not been completely removed. 








Fig. 14.24. Neutron radiography of a large turbine blade with cooling channels and 
particles used for internal cleaning after production. Images courtesy of PSI. 


For small turbine blades, the complete bulk can be well penetrated, and a 
complete reconstruction is possible. Figure 14.25 shows a section of a neutron 
computed tomography of a small turbine blade of about 4-cm diameter. The 
blade contains several large-diameter cooling channels and several rows of 
small-diameter draining bores for the cooling gas. 

Due to the half-moon cross section of the turbine blades, the inner chord 
is never accessible for tangential rays. All rays penetrating this area travel a 
long distance through solid material, and for large turbine blades, they are 
attenuated close to zero. This leads to a fuzzy reconstruction in that area, 
whether neutrons or high-energy X-rays are used for imaging. 

Figure 14.26 presents filtered back-projection reconstructions from simu- 
lated ideal X-ray projection data of a phantom at 200kV. In Fig. 14.26(a), the 
dynamic range was infinite, while it was reduced to 7 bits in Fig. 14.26(b). 
The latter case is an approximation for a real X-ray tomography experiment. 
Fuzzy reconstruction appears both on the inner chord of the shape and also 
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Fig. 14.25. Neutron computed tomography of a small turbine blade of about 4-cm 
diameter with large-diameter cooling channels and small-diameter draining bores. 


on the long, straight outer section, where rays have to penetrate a long path 
of bulk material. 

Figure 14.27 shows a stroboscopic neutron radiography of a combustion 
engine running at 1000 rpm, measured by a joint research team PSI, TU 
München, Uni Heidelberg, and ILL at the Institut Laue-Langevin, France 
[13]. A sensor on the cam shaft is used to trigger the camera at a defined 
position of the pistons. For the first time, the oil cooling of the pistons was 
made visible: An oil jet is ejected from below to the bottom of the piston, 
lowering its temperature by more than 200? during operation. The oil jet and 
the oil dome at the piston bottom are clearly visible. Standard back-projection 
tomography is not possible, as the engine cannot be penetrated along the large 
dimension parallel to the crank shaft. 


14.4.2 Proposed DT Research Project 


'The examples shown above demonstrate some very real technical problems 
with high possible merit, since large turbine blades are manufactured as single 
crystals and may cost more than 100,000 Euros each, so the development of 
this new method may prove to be very profitable. 

Both in the case of the turbine blades and in the case of the combustion 
engine, the ideal shape of the parts is available in the form of original design 
CAD data in the computer, and manufacturing is usually accurate within 
small fractions of a millimeter. 

Every modern CAD software can output the original CAD data in the 
format of an STL triangular surface mesh. At the software company Volume 
Graphics [16], a software tool is available to generate a voxel data set out 
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(a) (b) 


Fig. 14.26. Reconstruction of a phantom from simulated ideal data: (a) infinite 
dynamic range and (b) 7 bits. Fuzzy reconstruction appears where rays have to 
penetrate a long path of bulk material. 


of STL data to any scale and resolution, so the ideal reconstruction of the 
turbine blade (and in the long run, of the engine) can be calculated from the 
original design data. 

We propose to use this perfect reconstruction data set as a priori informa- 
tion in order to detect any deviation from this ideal shape, caused either by 
production defects (only two materials existent), or by obstructions by wax or 
coking, or lubricant in the case of the engine, which would mean a third ma- 
terial in the data set. Apart from detecting the third material, the main task 
of DT would be to enable tomographic reconstruction either from projections 
with limited angular range when some angles are not accessible or when the 
shape of the sample causes insufficient data for some angles. 

In the latter case, both neutrons and X-rays can be employed, but the 
detection of the third material is especially feasible for neutron tomography. 
TU München proposes to take over the physics part in a joint research project 
together with Volume Graphics and an institute of mathematics to be defined. 

We are aware that there is still a long way to go until the very recent 
method of discrete tomography can be applied to these actual physical prob- 
lems, but we also believe that the possible merit, both scientific and economic, 
will be very high and is well worth the effort of a joint research project. 
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Fig. 14.27. Stroboscopic neutron radiography of a combustion engine running at 
1000 rpm. An oil jet is ejected from below to the bottom of the piston. The oil jet 
and the oil dome at the piston bottom are clearly visible. 
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Emission Discrete Tomography 


E. Barcucci, A. Frosini, A. Kuba, A. Nagy, S. Rinaldi, M. Samal, S. Zopf 


Summary. Three problems of emission discrete tomography (EDT) are presented. 
'The first problem is the reconstruction of measurable plane sets from two absorbed 
projections. It is shown that Lorentz theorems can be generalized to this case. The 
second is the reconstruction of binary matrices from their absorbed row and columns 
sums if the absorption coefficient is po = log((1+/5) /2). It is proved that the recon- 
struction in this case can be done in polynomial time. Finally, a possible application 
of EDT in single photon emission computed tomography (SPECT) is presented: 
Dynamic structures are reconstructed after factor analysis. 


15.1 Introduction 


In the classical physical model of transmission tomography, some rays are 
emitted from a source and transmitted through the object to be imaged. The 
intensity of the transmitted rays is measured, and these data are the input for 
reconstructing the object. In this chapter, we deal with an alternative physical 
model used in emission tomography, where we measure rays emitted from the 
object itself. In our special case, we suppose that all points of the object emit 
rays into all directions of the space with unit intensity. It is also supposed that 
the whole space is filled with some homogeneous material having known ab- 
sorption. The projections of the ray-emitting object are acquired by detectors 
placed outside the object. We are interested in the reconstruction of objects 
from such absorbed projections. 

Emission discrete tomography (EDT) is when the object to be recon- 
structed is represented by a function having a known, discrete range. Many 
results of EDT concern reconstruction of finite 2D subsets of points hav- 
ing nonnegative integer coordinates, called discrete sets. These sets can also 
be represented by binary matrices. In the special case when the absorption 
coefficient is log((/5 + 1)/2) and the horizontal and vertical absorbed pro- 
jections are given, there is a necessary and sufficient condition of unique- 
ness [14, 15, 16], and reconstruction methods [2, 13] have been published. It 
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has been proved [3] that two opposite absorbed projections determine a dis- 
crete set uniquely. A general algebraic characterization of discrete sets being 
nonuniquely determined from a finite number of given absorbed projections 
is shown in [7]. There are results also in the more general class of measurable 
plane sets. It is proved that Lorentz theorems can be extended also in the case 
of absorbed projections [12]. 

In this chapter, we discuss three EDT problems: a pure theoretic, an algo- 
rithmic, and an application one. The very first problem in Section 15.2 is the 
generalization of Lorentz theorems when the two projections depend not on 
a constant absorption coefficient, but on some given separable functions, and 
these functions can be different in different directions. The next problem to 
be discussed in Section 15.3 is to reconstruct binary matrices from absorbed 
row and column sums if the absorption coefficient is log((V/5 + 1)/2). The al- 
gorithm shows that this reconstruction problem can be solved in polynomial 
time (cf. [2]). Finally, we show a possible application of EDT in medical imag- 
ing in Section 15.4. In nuclear medicine, there are imaging devices [e.g., single 
photon emission tomography (SPECT) cameras with two or more heads] to 
collect sequences of images simultaneously from a small number of projec- 
tions. That is, a 4D image (three spatial and one temporal dimensions) can 
be reconstructed from such data. The base of discrete tomography in this case 
is that before reconstruction the image sequences can be expressed as linear 
combinations of the projections of so-called factor images (the factors can be 
computed by factor analysis). Here the factors are 3D homogeneous objects 
to be reconstructed from their projections by some EDT method. The results 
of the first simulation experiences are given in the last part of Section 15.4. 


15.2 Generalization of Lorentz Theorems 


The reconstruction of measurable plane sets from their projections is a classi- 
cal problem with several applications. The projections are usually defined as 
the integral values of the characteristic function of the set along straight lines 
in different directions. One of the most frequently studied cases is that of the 
horizontal and vertical projections, i.e., the line integrals along horizontal and 
vertical directions are given. The measurable plane sets can be classified with 
respect to the projections as uniquely and nonuniquely determined ones. A set 
is uniquely determined or, shortly, unique if there is no essentially different set 
with the same horizontal and vertical projections. Otherwise, the set is said 
to be nonuniquely determined or, shortly, nonunique. Even function-pairs can 
be classified from the viewpoint of reconstruction as unique, nonunique, and 
inconsistent ones. A pair of functions is unique/nonunique if they are essen- 
tially the horizontal and vertical projections of a unique/nonunique plane set. 
It is also possible that there is no measurable plane set having projections es- 
sentially the same as the function-pair; in this case, the function-pair is called 
inconsistent. 
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The very basic result related to such classification of measurable plane sets 
is due to Lorentz, who published necessary and sufficient conditions on unique- 
ness/nonuniqueness/inconsistency of function-pairs in 1949 [19]. He also gave 
a characterization of the uniquely determined sets. Other results about the 
reconstruction of plane sets appear in [9, 17, 18]. 

A new kind of projection can be defined in the case of emission computed 
tomography, where the projections of the set also contain the effect of the 
constant absorbing space. This absorption problem is studied in the case of 
discrete plane sets [4, 7, 14]. There is also a result [12] that similar theorems 
can be proved for absorbed projections as for nonabsorbed projections in the 
case of measurable plane sets. 

Let us suppose that the object to be reconstructed is given by a measur- 
able subset in the first quadrant of the plane. The absorption in the plane is 
represented by a known constant u. The detectors are placed in the points of 
the positive parts of the z-, respectively, y-axis so that they measure the activ- 
ities along the vertical, respectively, horizontal half-lines of the first quadrant 
of the plane. 

We are going to show that the EDT defined above can be transformed 
into the classical transmission discrete tomography (TDT), where the sets are 
to be reconstructed from nonabsorbed projections. In this way, the problems 
of EDT can be reformulated as TDT problems. For example, uniqueness and 
existence in EDT can be reduced to the case solved by Lorentz theorems, 
and the uniquely determined plane sets in ED'T can be reconstructed and 
characterized in a similar way to TDT. 


15.2.1 Preliminaries 
Projections without Absorption 


Let F be a measurable plane set of finite measure, i.e., A2(F) < oo (Ag is the 
Lebesgue measure on the plane). Let x denote the characteristic function of 
FCR?. 

The functions 


oo 


[PxF](y) = / xr(z,y)dz = fx(y) (15.1) 
and v 
[Py F(x) = J xr(s,y)dy = fe(z) (15.2) 


are said to be the (nonabsorbed and nongeneralized) horizontal and vertical 
projections of F, respectively (see Fig. 15.1). Two measurable plane sets F 
and F” of finite measure are said tomographically equivalent (with respect to 
their projections), mathematically denoted by F ~ F", if 
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Fig. 15.1. The plane set F, its horizontal (fx), vertical (fy), and second projec- 
tions. 


PxF = PxF' and PyF = PyF’ a.e., (15.3) 


where “a.e.” abbreviates “almost everywhere.” 
The following problems are studied in connection with the reconstruction 
of measurable plane sets. 


UNIQUENESS U». 

Given: A measurable plane set F. 

Question: Does there exist a measurable plane set F’ essentially differ- 
ent from F such that F and F' are tomographically equiva- 
lent with respect to their (nonabsorbed and nongeneralized) 
horizontal and vertical projections? 


CONSISTENCY (or EXISTENCE) C3. 
Given: Two functions, p and q : IR — R+. 
Question: Does there exist a measurable plane set F such that 


[PxFl(y) = a(y) and [PyF\(x) = p(x) xe. (15.4) 


RECONSTRUCTION Rg. 
Given: Two functions, p and q : R — R+. 
Task: Construct a measurable plane set F such that 
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[PxF](y) = a(y) and [PyF\(2) = p(x) ac. (15.5) 


Considering the plane sets 


ty) |0 < æ < [PxF](y)} (15.6) 
and 

{(z,y) | 0 < y < [Py F](x)} (15.7) 
and taking their projections, we get the second projections of F, formally, 


oo 


55e J E E 2] dy (15.8) 


— oo 


and 
oo 


ju es J xix | fe(z) > y} de (15.9) 


— oo 


(see Fig. 15.1). 
For the third projections of F, we have 


oo 


petu J xie | fxv(z) > y) de (15.10) 


— oo 


and 
oo 


fossis J E > a} dy. (15.11) 


— oo 


'Then, using this notation, the Lorentz theorem has the following form. 


Theorem 1. Let fx(y) and fy(x) be nonnegative integrable functions such 
that 


[toa f sonas (15.12) 


(i) There exists an F C R? measurable plane set having these functions as its 
horizontal and vertical projections if, and only if, 


c c 


[tx dx > ] f dr for allc>0. (15.13) 
0 0 
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(ii) There exists a uniquely determined F C R? measurable plane set having 
these functions as its horizontal and vertical projections if, and only if, 


[ix dz = | 9 dz forallc>0. (15.14) 
0 0 


(iii) The functions fx and fy are inconsistent, i.e., no plane set F exists 
with these functions as its (nonabsorbed and nongeneralized) projection 
functions, if, and only if, 


[fx dz < J m dz for some c >Q. (15.15) 
0 0 


15.2.2 Reconstruction of Measurable Sets from Two Generalized 
Projections 


Our goal in this subsection is to show that the definition of projections can 
be even more general and that Lorentz theorems are still valid. Emission 
computed tomography will be considered again, i.e., all the points of the set 
that is to be reconstructed are emitting unit intensity rays in all directions. 
The whole plane is supposed to modify these rays—following different laws in 
different directions. We will show that the sets, and also the function-pairs in 
case of generalized modification, can be characterized in a similar way as it 
was done in the classical case or with a homogeneous p-absorption [27]. 


Generalized Projections 


Consider a measurable plane set G C R2 with finite measure. Suppose that 
the intensity of a ray being transmitted through set G changes by 


f= h: f os). (15.16) 


where Jp is the ray’s initial intensity and w, is a strictly positive function 
defined on R2. that additionally depends on the ray's direction r. In the fol- 
lowing, we are interested in the horizontal and vertical directions only, i.e., 
r — (1,0) and r — (0,1), respectively. Thus, for the horizontal and vertical 
projections, we have 


P(^()- f, xclsisu) ua cy) ae (15.17) 
0 


and 
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PY) (x) = i: E aec in idus (15.18) 
0 


We suppose that w(1,9) and wi,1) are the products of two strictly positive 
integrable functions each, i.e., 


w(,oy(r, y) = o(z) Bly) and won, y) = q(x): (y); (15.19) 


where a, f, y, and 6 are strictly positive functions defined on R+}. We define 
the generalized first projections of a measurable plane set G C R2. by 


[Pe gy) = J xa(x, y) - o(2) - B(y) dz (15.20) 

0 

and " 
(Po) a) = J xale;y) - (2) - 8(y) dy: (15.21) 


0 


We also suppose that these projections (integrals) exist almost everywhere. 
Two measurable plane sets G and G” are said to be tomographically equiv- 
alent with respect to their generalized projections if 


peg = Pq and PPG = PL a’ ae, (15.22) 


shortly, G ^" (o, B, y,0) G'. 
The following problems are to be investigated again. 


UNIQUENESS UG». 

Given: A measurable plane set G C R2. 

Question: Does there exist a measurable plane set G' C RÊ essentially 
different from G such that G and G' are tomographically 
equivalent with respect to their generalized horizontal and 
vertical projections? 


CONSISTENCY (or EXISTENCE) CGo. 
Given: Two functions, p and q : Ry — R+. 
Question: Does there exist a measurable plane set G C RŽ such that 


[PY Gly) = au) and [PP G](z) = pa) ae. 
(15.23) 


RECONSTRUCTION RG2. 
Given: Two functions, p and q : R} — R+. 
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Task: Construct a measurable plane set G C R2 such that 


[PP Gly) = ay) and [PPP G(x) = p(x) ae. 
(15.24) 


15.2.3 Transformation 


Similarly to the p-absorption, let us define two transformations of the plane 
into itself and the composition of these transformations. 
Let Tx and Ty be defined by 


Tx(z,y) = (ta(x),y), where talx) = [ow dé, «>0, (15.25) 
0 


and 
Ty(#,y) = (a,te(y)), where ts(y) = I rai y>0, (1536) 
0 


and for the composition T = Tx o Ty = Ty o Tx, we have 


T(x,y) = (ta(a), ts(y))- (15.27) 


Since a and 6 are supposed to be strictly positive functions, Tx and Ty, 
and, thus, T are injective transformations of the plane R2 into itself. This can 
be shown with the help of the following lemma (the proof is omitted here). 


Lemma 1. Let £ be an everywhere strictly positive integrable function R —> 
R4. For each E measurable subset of R, which has a positive measure, the 
integral of € over E is positive, formally, 


fo dz > 0. (15.28) 
E 
Lemma 2. Let £ be an everywhere strictly positive integrable function R —> 
R+. The integral function of e, 


x 


E(x) = (EG dé, (15.29) 
0 


is an injective function R — R. 


The transformation T plays a very important role in the following. It gives 
the possibility to reduce the reconstruction problem of generalized projections 
to the classical reconstruction problem in such a way that we consider the 
measurable plane set 

F = TG. (15.30) 
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Generalized First Projections 


We define the following two functions from which the original generalized 
projections can be calculated by 


© y) = PLE) = J xc(z, y) - a(z) de (15.31) 

0 

and n 
st? (x) = [PË G]() = ] xe - ó(y) dy . (15.32) 


0 
'Then for the generalized projections, we have 


[PS Gly) = By) PGy) ana. [PÀG] E) = (a) - Pre (2). 
15.33 
For any measurable sets J C R and J CR, we know that 


ee d= fias and prow ay= [ tay, (15.34) 


tol Jd. ts J 


in other forms, 


[ut a(x) dz = [eat dx and fat :ó(y) dy = doigt dy. 
0 0 0 0 


(15.35) 
Using all these facts, we can calculate the generalized first projections of 
G: 


u) = [Pay = / xa(2, y) - (ae) de = J xr c(s, y) de 
= [PxTxGly) =[PxTCltsly)) 
= fx(ts(y)) (15.36) 
and 
g(a) = [P9 aya) = P xc(s,y) - &(y) dy = J xryelt,y) dy 
= [PyTyGl(«)=[PyTGl(ta(e)) —— 
= fy(ta(z)). (15.37) 


This shows that the generalized first projections of G can be calculated from 
the (nonabsorbed and nongeneralized) projections of the transformed sets 
TxG and Ty G and from those of the transformed set F = TG. Note that the 
functions 8 and y do not play a role when we calculate these equalities and 
transformations. 


342 E. Barcucci, et al. 
Generalized Second Projections 


For the generalized second projections of the plane set G C R2, we do not 
take into account the G- and y-functions. These projections are defined by 


se» = PPP s.v) a y) = 2} (15.38) 
and . : 
IP = PU?) y) of (2) = y) (15.39) 


Let us calculate the generalized second projections, 


LI = PLE yg (y) » x) = PyTV( y) lg (y) > x) 
= PyTy{(2,y) | fx (ts(y)) > 2} = Pvt v) | fx(y) > a) 
[xv (15.40) 


and, similarly, 


IS = PLE, y) | 9 (a) > y) = Px Tx (xy) lg (a) > y) 
= PxTx{ (x,y) | fy (to(2)) > y) = Px y) fv(z) > y) 
= frx. (15.41) 


From these equations we see that the generalized second projections of a set 
G are identical to the second (nonabsorbed and nongeneralized) projections 
of the transformed set F = TG. 


Generalized Third Projections 


The generalized third projections are defined by 


Ix = Px (oy) | gS? (x) = y} (15.42) 
and : : 
ey = Py (Gy) o (y) = a}, (15.43) 
and, thus, we get 
EÈ = Px{(a,y) | 9S? (v) > y) = Px{(a,y) | fxy (a) 2 v) 
= fxyx (15.44) 
and 
(a) — p (6) 0) sel p s 
gyxy = Py{(x,y)|gyx (Y) > z} = Py{(x,y)| fvx(y) 2 x) 


= fyxy. (15.45) 
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We are going to prove a theorem about the existence and uniqueness of a 
measurable set G for given generalized projection functions similar to the 
cases without absorption and with p-absorption. 


Theorem 2. Let a(x) > 0, B(y) > 0, y(x) > 0, and ó(y) > 0 ae. be four 


strictly positive functions Ry — Ry. Given gS? (y) and 99? (x), two non- 


negative integrable functions, such that 


œ (a,b) È (7,8) 
Ix (y). s gy CNN x 
SHUT ó(y)dy — og (x) da. (15.46) 


(i) There exists a G C R2. measurable plane set of finite measure having these 
functions as its horizontal and vertical generalized projections if, and only 


if, 


c c 


[Peo dz > [io dz, forall c> 0. (15.47) 
0 0 


(ii) There exists a uniquely determined G C R2 measurable plane set of finite 
measure having these functions as its horizontal and vertical generalized 
projections if, and only if, 


c c 


[Po dr = [io dz, forall c >Q. (15.48) 
0 0 


(iii) The functions ge and qo» are inconsistent, i.e., no plane set G exists 
with these functions as its generalized projection functions, if, and only if, 


c c 


[Po dz < [io dz, for some c > 0. (15.49) 
0 0 


Proof. Consider the plane transformation 


x 


T(z,y) = Ga(2),65) = ( i a(é) d£, | &(n) an). (15.50) 
0 


0 


Let us define the functions fx and fy by 


(2,8) (7:5) ( 
frx(ta(y)) = 2 and fr ta) = 2, (15.51) 


These functions are defined in tR, and taR+, respectively. Let us define 
them to be 0 out of these sets. Since t, and ts are injective (see Lemma 2), 
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these definitions make sense. From the definitions of fx and fy and from the 
assumption, we know that 


* 7? (esf) *? 9. *: 
fikoa = £x yy cay = [Saw ae = f eG) ae. 
0 0 0 


(15.52) 
We are going to use Theorem 1 in the proof of the three cases. 


(i) There exists a measurable plane set F' with (nonabsorbed and non- 
generalized) given projections fx and fy if, and only if, the condition 
fo [xv (z)dz > fy fvxv(x) dz is satisfied, for all c > 0. Consider (15.40) 


and (15.45). If there exists a set G with the generalized projections gc 


and gt 8) then fx and fy are the (nonabsorbed and nongeneralized) pro- 
jections of F = TG and thus, the inequality holds. If the inequality holds, 
there exists a set F C R? with the (nonabsorbed and nongeneralized) pro- 
jections fx and fy. The set FTIR? has the same (nonabsorbed and 
nongeneralized) projections (almost everywhere) since the projections are 
0 outside tsR4 and t4IR,, respectively. Thus, G = T~!(F N TR?) is a 
measurable set with the generalized projections go ) and ge 8), 

(ii) The set F is unique if, and only if, [> fxy (£) dr = fy fyxy(x) da, for 
all c > 0. Since T is an injective transformation of the plane into itself 
and F n (TTR2)* has measure 0, the set G is also unique with respect to 
its generalized projections if, and only if, fj gxv (x) dx = fy gv xv (x) da, 
for all c > 0. (M* denotes the complement of the set M.) 

(iii) No set F exists if, and only if, there exists a c > 0 with ie fxy(a) du < 
IN Ív xv (x) dz. As a similar consequence, there is no set G with the gen- 
eralized projections gS’ ) and of 8) if, and only if, there exists a c > 0 


with Jo gxy (x) dz < Ju gv xv (v) dz. 














An Example for Generalized Absorption 


Let us assume a material with anisotropic absorption. This means that dif- 
ferent absorption coefficients have to be taken into account in the z- and 
y-directions. A ray's intensity will change by the equation 


I=]: [ee dr, (15.53) 
G 
where u = (Hz, fly) is the vector of the absorption coefficients. Calculating 


the intensities along the z- and y-directions, we get 


PP (y) = Jer dx and PLY) (gy = Jpm dy (15.54) 
G G 
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or, equivalently, 


PE (y) > nm .e "7" dy and PE (s) = J xe e My dy. 
0 0 


(15.55) 
Setting a(x) = e "**, B(y) = 1, y(x) = 1, and ó(y) = e "v*, we get an 
example of the generalized case. The ray intensities in each direction change 
by the same law as they do in case of the jj-absorption, but for given functions 
we will get different sets having these functions as generalized projections. 
Applying the transformation 
jette qe cuv 
T (a, y) = (—~—_, —_—) (15.56) 
Ha: Hy 
on the plane, we can reduce the problem of generalized projections to the 
classical (nongeneralized) case and vice versa. 


Uniqueness in the Case of Generalized Projections 
As a consequence of Theorem 2, we get directly 


Theorem 3. Let a, B, y, and 6 be four strictly positive, integrable functions 
R — R,. Let G C R? be a measurable plane set having a finite measure. G 
is uniquely determined by its generalized projections gen? and gree if and 
only if, 
g(x) = gH, (x), for alla ER. (15.57) 
Remark 1. It is easy to see that Theorem 3 is true also if (15.57) is replaced 
by 
6 6 
xy) = Krl), for ally ER. (15.58) 
Theorem 2 also indicates that whenever a measurable plane set G is unique 
in the generalized case, its transformed set T'G is also unique in the non- 
generalized case and vice versa: Whenever a measurable set F C TR? is 
unique with respect to its (nonabsorbed and nongeneralized) projections, the 
set G = T-1F is unique with respect to its generalized projections pee G 


and pong 'The transformation T' is defined by 


T(s,y) = ( J a(é) d£, / (n) d). (15.59) 
0 0 


Finally, le& us mention that a characterization of the measurable plane 
sets, which are unique with respect to their generalized projections, is given 
in [27]. 
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15.3 An Algorithm for Reconstructing Binary Matrices 
from Absorbed Row and Column Sums 


In this section, we move from the general class of characteristic functions to 
the special class of binary matrices. We consider a reconstruction problem 
when the absorption coefficient is just y = jo = log((1 + V5)/2) and the 
projections are the absorbed row and column sums. We are going to prove that 
this reconstruction problem can be solved in polynomial time and even such an 
algorithm is presented here. Let us remark that a polynomial reconstruction 
algorithm is published also in the case when the two absorbed projections are 
the opposite row/column projections of the binary matrix to be reconstructed 
[4]. 


15.3.1 Definitions and Notation 


Using the notations of [15], let A = (aij) x» be the binary matrix to be 
reconstructed (i.e., aj; € {0,1} for i = 1,2,...,m, j = 1,2,...,n). In order to 
make the notation simple, let G9 = e^» = (1 + V/5)/2. Then the (uo-absorbed) 
horizontal and vertical projections of A are defined as 


Rg(A)— R— (ri...Tm) where m= M ayb (15.60) 
j=l 
and 
Cg,(A) =C = (c1,...,Cn), where cj— X ayb ; (15.61) 
i=l 


respectively, for each 1 € i X m and 1 € j € n. 
The selected £o has the following important property: 


BÉ = Bo AS. (15.62) 


Equation (15.62) means that if three consecutive elements in a row of a binary 
matrix are 100, then they can be changed (switched) to 011 (or vice versa) 
without changing the absorbed row sum. Of course, an analogue statement 
is true also for columns. This switching property is the base of the whole 
uniqueness problem. A switching theory for this scenario has already been 
provided in [15, 16]; we only recall some useful basic definitions and results. 

A binary word w = a1 ...@n, with n EN, is called a finite n-length repre- 
sentation in base Bo (briefly, a Bo-representation) of r € R if 





r—81: By. +a: Bg? as Bg, (15.63) 


while r is said to be the value of w. A Go-representation is a jg- expansion if 
it does not contain any 011 subword. Naturally, a number r can have several 
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n-length @o-representations, but there is only one fo-expansion of length n. 
As an example, the following words are /jo-representations of length 11 of the 
same number: 


w;—0101101000 0, 
w2=01011001100, 
w3 =0110000110 0, 
wa = 10000010000, 


(15.64) 


but only the last one is a Go-expansion. 

Using the concept of 8o-representation, we can say that if r; and cj are 
the ith row sum and jth column sum, respectively, of an m x n binary matrix 
A (1 € i € m and 1 € j € n), then the words created from the binary 
digits of the binary matrix A in row i, i.e., aj10;2...aG;, and in column J, 
i.e., 415025 ... à, are n-length and m-length Go-representations of r; and cj, 
respectively. 

It is easy to see on the base of (15.62) that not only 100 can be replaced 
by 011, but also any subsequence 1(0x)*00 by O(1x)*11 (k > 0), and vice 
versa, in a fo-representation without changing the value of the represented 
number (x stands in the positions of binary digits, which do not change their 
values during the replacement). We call such a replacement a 1D switching 
and denote it by 1(0x)*00 — 0(12)^11. Specially, the simplest 1D switching, 
100 — 011, is called (1D) elementary switching. 

It is easy to see that each 1D switching can be obtained by the consecutive 
application of 1D elementary switchings. As an example, the 1D switching 


1010100 e 0111111 (15.65) 
can be obtained by the application of three 1D elementary switchings: 
1010100 — 1010011 — 1001111 — 0111111. (15.66) 


More generally, it is also true [16] that any n-length o-representation of a 
number can be obtained from any other n-length Óo-representation of the 
same number by consecutive applications of 1D switchings. 


15.3.2 Reconstruction from 4o-Absorbed Row and Column Sums 


Now, we consider the reconstruction problem in the case when the uo-absorbed 
row and column sums of the binary matrix are given. 


RECONSTRUCTION RA. 
Given: Two nonnegative real vectors, R and C. 


Task: Construct a binary matrix A such that its ug-absorbed row 
and column sums, i.e., R(A) and C(A), are R and C, respec- 
tively. 
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Before giving a reconstruction algorithm, we extend the concept of 1D 
switching to that of 2D switching, i.e., a transformation that changes some el- 
ements of the binary matrix A, without modifying its horizontal and vertical 
uo-absorbed projections. A complete study of 2D switchings from a tomo- 
graphical point of view can be found in [15, 16]. 


2D Switching 
Using the notation of [15], let 
Sij = t= 1,i,i +1} x {j 5343441) (15.67) 


be the 3 x 3 discrete square set of positions of A such that 1 < i < m 
and 1 < j < n. Two squares Sij and Sj; are said to be side-connected if 
either i = i' and |j — j'| 2 2, or |i — i'| 2 2 and j = 7’. A collection X of 
discrete squares is side-connected if each couple of its elements Sij and Sj; 
is connected by a side chain, i.e., by a sequence of elements of X starting 
with Sij, ending with Sjj, and such that any two consecutive elements of 
the sequence are side-connected. The side-connected set X is strongly side- 
connected if, whenever two of its elements intersect, then either they are side- 
connected or they have a common side-connected neighbor. By definition, a 
single square is a side-connected set. Examples of not side-connected, side- 
connected, and strongly side-connected sets are given in Fig. 15.2(a)-(c). 































































































a) b) c) 


Fig. 15.2. Examples of not side-connected, side-connected, and strongly side- 
connected sets of discrete 3 x 3 squares. The squares indicated by their borders 
are the elements of the sets. (a) A not side-connected set. (The highlighted squares 
are not side-connected with any other squares of the set.) (b) A side-connected 
set, which is not strongly side connected. (The highlighted squares have a common 
position, but they are not side-connected.) (c) A strongly side-connected set. 


Lemma 3. The elements of a (strongly) side-connected set X! can be re- 
arranged in X' so that each prefix of X" is a (strongly) side-connected set 
itself. 
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Proof. Let X = {5S}, S2,..., Sk} be a (strongly) side-connected set. We re- 
arrange X in the sequence X’ = (51,55,..., Sp} so that $1 = Sı, and, for 
each 1 < i < k, the element S; is the first element of X not yet belonging to X” 
and is (strongly) side-connected with at least one of the elements of X". The 
existence of such a X" can be proved indirectly. Let us assume that there exists 
an index | (1 < | € k) such that none of the elements of XXV (51, S5,..., Sj) is 
(strongly) side-connected with any element of (51, 55,..., S1). Then there is 
no side chain between any two elements of these sets, which is a contradiction. 














From now on, each time when we deal with (strongly) side-connected sets, 
we consider the sets rearranged according to Lemma 3. 

We define the pattern as a binary-valued function defined on an arbitrary 
subset of the positions of A. The domain of the pattern P is denoted by 
dom(P). Furthermore, we say that a pattern P contains a pattern P’, denoted 
as P' C P,if P is the extension of P' according to the usual analytical 
meaning. 

Let the two patterns 


(0) _ ics 
EDs and | EQ = (15.68) 


er © 
oor 
oor 
oOrF 
=.. o 
=.. o 


be defined on the discrete square $;;. It is easy to check that the substitution 


of E ) with ED ina binary matrix or vice versa is a 2D switching, i.e., it does 
not change the jig-absorbed horizontal and vertical projections. Furthermore, 
such a switching is minimal with respect to the number of elements involved. 
We refer to the patterns EQ and EY as 0-type and 1-type 2D elementary 
switching patterns, respectively. 
The composition of patterns P and P’ is denoted by © and is defined on 
the set 
dom(P © P’) = dom(P) A dom(P’) (15.69) 


(A denotes the symmetric difference) by 


poren (Ree Ream om 


Figure 15.3 shows the composition of two patterns P and P' having nondis- 
joint domains. 
Two patterns, P and P', are strongly linked in a pattern P" if 


dom(P) Udom(P’) C dom(P", POoP'CP' PZP", and P'g P". 

(15.71) 
It is important to remark that if P and P' are strongly linked in P", then 
dom(P) N dom(P^) # 0. Figure 15.4 shows two different cases. In the first 
case, the patterns P and P' of Fig. 15.3 are strongly linked in the pattern 
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1011| 01 

1:9:0 1]. 0:1 

00; : : 0011 
90:0). : [01:0 
1:1 uim 0:0 0 
Ooo oT 011 
100011 0:0:0:0 

P P P@P’ 
Fig. 15.3. The composition of the patterns P and P’. The symbol “—” indicates 


the positions of undefined elements in dom(P) U dom(P’). 










































































1:09.11 1|[0 1 1:0 1]0 1]0 1 
1:00 0 1/0 0 1090/01/00 
0:000 01 1 0.0/0j0 011 
0:0/0:0]0:1:0 00/0 0/0:1.0 
1:10 00 0 0 11/0 0|0 00 
0000001 0000]|11 1 
1:0:0 0:1 1:0 1 0:000 0:0 
a) b) 


Fig. 15.4. The patterns P and P’ in Fig. 15.3 are strongly linked in pattern (a), 
while they are not in pattern (b). 


of Fig. 15.4(a). The same patterns are not strongly linked in the pattern of 
Fig. 15.4(b), because it contains the highlighted pattern P’. 

In the sequel, we indicate with p : E,..., Ex the sequence of the 2D ele- 
mentary switching patterns Ej,..., Ey of the same type, whose domains form 
a strongly side-connected set. Furthermore, let One Ej denote the successive 
compositions of the elements of p, formally, 


k 
(QE -[-.[[Ex © E © E © -+ © Ex). (15.72) 


(Ol E; = E, by definition.) As an example of the successive compositions 
of BY), EQ, ED, 49, and EY) ,; see Fig. 15.5. 
It is easy to check the following property. 


Property 1. Let Ej; and E> be two elementary switching patterns, strongly 
linked in a pattern P. The sequence of the three consecutive elements of the 
pattern P in positions dom( £4) ndom( £3) is different from both 100 and 011. 


We now define the general notion of the 2D composite switching pattern, 
which plays a central role in the algorithm for solving RECONSTRUCTION 
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1:0:0 1:0 —0 0 (1:0|7|0:0 
011 au —un 01|—1:1 
——— ——|0]1.1 '1.0/0/—0|1 1 
01:1 0:1 1 01-11 
01:1 011 0 1|—1:1 

a) b) c) 


Fig. 15.5. The steps (a), (b), and (c) giving the successive composition of EO 


1) ? 
1 1 1 
EU aH ss ud E us 


RA. Given a sequence p : Ej,..., Ex, the 2D composite switching pattern E 
is defined so that 


(i) dom(E) = UX, dom(£;), 

(ii) Or, E: C E, 

(iii) there exists a sequence p! : E1,..., Ej, whose elements are a permutation 
of those of p such that Oe E; and E; are strongly linked in E for each 
1 «iX k (see Fig. 15.6). 


Any 2D elementary switching component is also a 2D composite switching 
component by definition (k — 1). The elements of sequence p are said to be 
the components of the composite switching pattern E, which also inherits 
its type from its components. Figure 15.6 shows two patterns E' and E" 
containing the composition of the elements of the sequence 

PUEDE SEN EE sess (15.73) 


ij o Pi j+ i23 





At first sight, it may seem that both E' and E" do not satisfy condition (iii) 
of the definition of the elementary switching pattern. But if we consider the 
sequence 


p : pops 


(1) (1) 
Ei mf (15.74) 
we notice that it is a rearrangement of the elements of p preserving the side 
connectedness and satisfying condition (iii), as desired. Hence E' is a compos- 
ite switching pattern, while no such rearrangement can be found for E". 


Lemma 4. A 2D composite switching pattern does not contain any other 2D 
composite switching patterns. 


Proof. Let us assume that E and F’ are two 2D composite switching patterns 
whose components are the elements of the sequences p and p’, respectively, 
such that E' C E. It is easy to see that E and E' have the same type and 
p' is a subsequence of p. By definition, there are E; € p X p' and E; € p’, 
which are strongly linked in E. Since E’ does not contain E;, the sequence of 
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1 0/0/00 1 0000 
01/011 01/011 
E’: |1:0|1/1 0 E": |1:0|1/0 0 
01/0/11 01/011 
0 1/01 1 0 1/0/11 
































Fig. 15.6. Pattern E' is a 2D composite switching pattern, while E" does not satisfy 
condition (iii) of the definition. The highlighted elements indicate the positions for 
which the 2D elementary switching components containing these positions are not 
strongly linked in E' and E". 


the three consecutive positions of dom(£;)M dom(£;) have values 100 or 011 
in F’. At the same time, by Property 1, the same consecutive positions have 
values being different from 100 and 011 in E, which is a contradiction. 











In [15], it is proved that a 2D composite switching pattern is defined up 
to the order of its components. Furthermore, it has the following remarkable 
result. 





Theorem 4. Let E and E’ be two 2D composite switching patterns obtained 
from the two sequences 


p: FO... BO. and p: B®. ,..., BY (15.75) 


5,71? 1th Ik 41,31? 1th Ik? 


respectively. If E coincides with E' in dom(E \ O E then the sub- 
stitution of E with E' or vice versa in any binary matriz does not change 
the uig-absorbed horizontal and vertical projections of the matriz, i.e., it is a 


switching operator. 


'The switching operator defined in Theorem 4 is also called composite 
switching, while the pair (E, E") is called composite switching pair. The com- 
posite switching is the most general switching as it is shown in [15] by the 
following result. 


'Theorem 5. Each 2D switching can be obtained by the successive application 
of composite switchings. 


Given a composite switching pattern E, two complementary sets of posi- 
tions can be defined on its domain: the core, which contains the positions being 
all elements of its 8-neighborhood in dom(F), and the boundary containing 
the rest of the positions of dom(£). 


Lemma 5. Two 2D composite switching patterns contained in a binary matrix 
can intersect each other only along their boundaries. 


Proof. This immediately follows from Lemma 4, since if two 2D composite 
switching patterns share some positions in their cores, then they also share 
some 3 x 3 elementary switching pattern and, consequently, they contain the 
same 2D elementary switching pattern. 
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If there is no absorption, then the reconstruction algorithms usually do not 
deal with the search of possible 2D switchings. The reason is that the (unab- 
sorbed) 2D switchings in a binary matrix can constitute a very complicated 
structure and it cannot be handled simultaneously by a polynomial-time pro- 
cess. On the contrary, in the case of absorbed projections, as it is shown, the 
domain of a switching becomes a connected area, and this property provides 
the possibility of detecting all switchings in a binary matrix easily and recon- 
structing it efficiently by handling all its possible ways of growing, without 
slipping into nonpolynomial computations [2]. 

It is clear that the j;jp-absorbed row and column sums of an m x n bi- 
nary matrix are nonnegative real numbers having n-length and m-length, 
respectively, Bo-representations. Let us call such vectors compatible. So, the 
m x n binary matrix is reconstructed from compatible vectors R and C, entry 
by entry, in a process that exhaustively searches for all possible elementary 
switching pairs. Once one of them is supposed to be found, the algorithm tries 
to follow its evolution by means of two different lines of computation (only 
two lines are sufficient by Lemma 4). Finally, if no inconsistencies are found, 
the two patterns are fixed in A, and their core positions will not be changed 
by the computation any more. 

The algorithm relies on two binary m x n matrices ROW and COL, which 
are initialized with the Bo-expansions of the elements of R and C, respectively. 
During the process, they are updated according to the part of the matrix that 
has been already reconstructed, and they eventually can be duplicated if a 
switching pair is detected. 


REC2DswITCH. 
Input: Vectors R = (ri,...,r4,) and C = (c1,...,¢n) with nonnega- 
tive real numbers. 
Output: A binary matrix A, whose horizontal and vertical ug-absorbed 


projections are R and C, respectively, if such a matrix exists, 
else failure. 


Step 1: Create the m x n binary matrices ROW, COL, and A such 
that 
- the ith row of ROW is the fo-expansion of ri (i = 
1:2; iym); 
- the jth column of COL is the fo-expansion of cj (j = 
1215-0), 
- all the elements of A are undefined. 
Step 2: for l1<j<nand1<i<m, 


if (i, j) is not a position in the core of an already detected 2D 
composite switching pattern, then 
2.1 if ROW |i, j| = 0 and COLL[i, j] = 0, then set A[;, j| = 0; 
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2.2 if ROW [i, j| = 1 and COL{i, j| = 0, then set Ali, j] = 0 
and perform the shortest 1D switching in the ith row of 
ROW starting from position (i, j). If there is no such 1D 
switching, then halt and failure; 

2.3 if ROW [i, j| = 0 and COL{i, j| = 1, then set A[i, j] = 0 
and perform the shortest 1D switching in the jth column 
of COL starting from position (i, j). If there is no such 
1D switching, then halt and failure; 

2.4 if ROW [i,j| = 1 and COL[i,j] = 1, then HUNT- 
SWITCH(i, j); 

Step 3: Give the matrix A as output. 


Now, we briefly sketch the subprocedure mentioned in Step 2.4, hunting 
for elementary switching pairs: 


HUNT-SWITCH (i, j). 
Step A: Create the local matrices ROW’, ROW", COL’, and COL” 


initialized as 


ROW' = ROW" = ROW and COĽ = COL" = COL. 
(15.76) 
Step B: We are going to check the possibility of two composite switch- 
ing patterns E©) and E® in A forming a switching pair. 
Their components will be denoted by p and p(), respec- 
tively. Let i' =i+1, and j’ 2 j 4 1. 
Step B.A: Assume that Es belongs to p% . Try to create 


EUR in ROW' and COL’ by performing the shortest 
1D switchings in the rows from i’ — 1 to i’ +1 of ROW' 
and in the columns from j' — 1 to j' +1 of COL’. If 
such switchings are not possible (no E©) in A), then set 
Ali, j| = 1 and exit the subprocedure; 

Step B.B: Assume that E s belongs to p). Try to create 


EC) ,, in ROW" and COL" by performing the shortest 
1D switchings in the rows from i’ — 1 to i’ +1 of ROW" 
and in the columns from j' — 1 to j' -- 1 of COL". If 
such switchings are not possible (no E(? in A), then set 
Ali, j| = 0 and exit the subprocedure; 

Step B.C: Check the boundary of EA [respectively, of 


EQ 4] for other 0-type [respectively, 1-type] 3 x 3 el- 
ementary switching patterns, which are strongly linked 
with it, and which have to be inserted in p? [respectively, 
pO. For each elementary switching pattern, apply Steps 
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B.A and B.B recursively, after uploading i’ and j’ to the 
central position of the elementary switching pattern; 
Step C: Copy those entries of ROW’ whose positions belong to 
dom(E)) in the corresponding positions of A and update the 
entries of ROW and COL with those of ROW' and COL’, 
then exit the subprocedure. 


Remark 2. In Step C of HuNT-SwITcuH(i, j), the choice of updating the ma- 
trices ROW and COL with the entries of ROW’ and COL’ instead of with 
those of ROW" and COL” is arbitrary. However, once the choice has been 
made, it has to be maintained all over the reconstruction process in order to 
prevent inconsistencies that may arise when two composite switching patterns 
intersect. 


The proof of the correctness of REC2DSWITCH is immediate, since it per- 
forms an exhaustive search of all the composite switching pairs contained 
inside each solution. 


Theorem 6. Let R and C be compatible vectors. Algorithm REC2DSWITCH 
reconstructs a binary matrix having the uig-absorbed row and column sums R 
and C, respectively, if such a matrix exists; otherwise, it returns with failure. 


With the following theorem, it is proved that the reconstruction prob- 
lem RECONSTRUCTION RA can be solved in polynomial time with algorithm 
REC2DswITCH. 


Theorem 7. The problem RECONSTRUCTION RA can be solved in polynomial 
time with respect to the dimensions m and n. 


Proof. The complexity of REC2DSWITCH is computed from the complexities 
of each of its steps. 


Step 1: The matrices ROW, COL, and A are created in O(mn) time. 

Step 2: It requires at most mn calls of its substeps. The complexities of 
Step 2.1 and Step 2.2 are constant, while Step 2.3 involves the call of 
the procedure HUNT-SWITCH. This procedure simply scans and compares 
part of the positions of the matrices ROW’, ROW”, COL’ and COL", 
in O(mn) time. 

Step 3: It requires again O(mn) time. 














So, the total time complexity of the procedure is O(m?n?). 
Let us conclude this section with a remark and an example. 


Remark 3. Algorithm REC2DswrrCH detects all the switching pairs present 
inside each of its solutions, so it allows us to solve the related uniqueness 
problem also. 
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Example 1. As an example, we show how algorithm REC2DSWITCH recon- 
structs a solution when the input vectors are 


R= (B5. B5. +807 c 853, Bg + 859, B. + PB BS + B) 


and 





(15.77) 





C= (B5? + 80°, B5 + B? +o, Bo + Bg, Bo 1 bi^ Bo + Bo * s 2 


(15.78) 
In Step 1 the matrices 
1000000 1111011 
1110000 0100000 
ROW —1000010, COL=0100010, (15.79) 
1000100 0000100 
1010000 0010000 


and A are created. 

Since ROW[1,1] 2 COL[1,1] = 1, Step 2.4 calls Hunt-Switch(1, 1), which 
returns failure, because it is not able to find a switching pattern in this posi- 
tion. Then Step 2.4 proceeds in two parallel computations, setting the entry 
in position (1,1) both to the value 0 and to the value 1. Since the process for 
the value 1 fails in position (1,2), then A[1,1] = 0 and ROW and COL are 
updated with the performed 1D switchings, i.e., ROW[1,2] = ROW[1, 3] = 1, 
and COL[2,1] = COL[3, 1] = 1. 

In position (2, 1), similar steps are performed and A[2, 1] is set to 1 with 
no further changes in ROW and COL. 

The analysis of position (3,1) starts and HUNT-SwITCH (3,1) is called. 
The entries in dom(E{?)»)) in ROW” and COL’ have to be modified with a 
series of 1D switchings involving rows 3, 4, and 5 of ROW’ and columns 1, 2, 
and 3 of COL’. The process is successfully completed and then it is applied to 
ROW" and COL” as well. At this stage, the four matrices are the following: 


0110000 0101011 0110000 0111011 
1110000 1110000 1110000 1100000 
0110010, 0110010, 1000010, 1000010, 
1000100 1000100 0101111 0100100 
1010000 1010000 0111100 0110000 
ROW' COL ROW" COL" 


where the boldface entries are those involved in the 1D switchings. 

Now Step B.C checks the boundaries of Es and ET for the presence of 
strongly linked elements present in p? and p“). In this case, only the positions 
(3, 3), (4, 3), and (5,3) allow one, so Steps B.A and B.B are performed again 
after updating 7' = 4 and j’ = 4. 
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The process goes on without finding inconsistences until all the compo- 
nents of the elementary switching pair are detected. The four matrices com- 
puted at the end of Step 2 are shown in Fig. 15.7, where the elements of EQ 
and E(U are indicated by 3 x 3 boxes. 






























































:01/01/0|11 0:11 0|0 0 0 
i1 1100/00 11100 1/01 1 
01/1/0010 1:0/0]0]0|1 0 
1 0/0/0|1/0:0 01/0/1!1/11 
1010000 0111011 

ROW = COL ROW "= COL" 


Fig. 15.7. The matrices ROW’ = COL’ and ROW" = COL”, and the composite 
switching patterns E and EO), 


Matrices ROW, COL, and A are updated, and HUNT-SWITCH (3, 1) termi- 
nates. Step 2 of REC2DSWITCH finishes by scanning ROW and COL without 
making any further change, and matrix A, which is equal to ROW, is given 
as output. 


15.4 Reconstruction of Factor Structures Using the DT 
Method 


In the last section of this chapter, we consider the following problem. Let 
us suppose that there is a 3D dynamic object, which can be represented by 
a nonnegative function f(r,t), where r and t denote the position in space 
and the time, respectively. Suppose that f can be expressed as a weighted 
composition of a number of (so far unknown) binary-valued functions f;(r), 
k — 1,2,..., K (K > 1), being constant in time, such that 


f(r,t) = e(t) - fair) + e(t) - falr) +: t ex(t) f(r) + n(,t), (15.80) 


where c(t) denotes the kth weighting coefficient, which depends on time, and 
n(r,t) represents the noise or residual in (r,t). Given the assumption that 7 
and f are uncorrelated, c(t) and f(r) are to be determined so that the f; 
are independent of the fj, for all k Z j. If the values of f(r,t) are available, 
then the problem can be solved by factor analysis. 

However, in reality, we cannot always measure the function f in the points 
of the space, but we can measure certain projections only. This is frequently 
the case, for example, in nuclear medicine, where the object is the radioactivity 
distribution in an organ and the projections are the gamma camera images 
from different directions. In such a case, SPECT imaging is applied to collect 
projection data for reconstructing tomographic slices of the object. 
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Furthermore, we deal with a special emission tomography model, where 
the object is considered as a set of points emitting rays into all directions of 
the space, and the space is filled with some material attenuating the rays. It is 
important that the absorption is included into the definition of a projection. 

Let f(r,t) denote the intensity function of the object to be reconstructed. 
Suppose that the absorption in the space is constant; u > 0 everywhere. All 
half-lines in the space can be described as (S, v) = (S-Fu:v| u > 0}, where 
S and v are the point and the direction of the half-line, respectively. Then 
the absorbed projections of f in time t can be measured along half-lines by 
so-called point detectors: 


[PH f\(S,v,t) = fre +u- v, t) e™" du. (15.81) 
0 


For the sake of simplicity, let us suppose that the support of f is the 
3D unit cube, and the detectors sit on the left, right, up, and down sides of 
the cube, and measure the absorbed projections along half-lines, which are 
perpendicular to the corresponding side of the cube. 

The problem is to reconstruct f from a small number of absorbed projec- 
tions, especially if we have only four projections, £9 f, R9 f, U™ f, and 
D) f, defined as 








1 
EW fut) = f fuus e du, 
"i 
R” f(y, t) = nuc —u,y,t)-e "" du, 
j 
UY) fi(a,t) = | f(z,1—u,t)-e-"* du, 
^ 
D? f(a, t) = f 190m du. (15.82) 
0 


That is, we are given two opposite horizontal and two opposite vertical ab- 
sorbed projections (see Fig. 15.8). Let us call the first two projections left 
and right ones, and the other two projections wp and down ones. (In the 
emission tomography model, opposite projections are not identical because of 
attenuation. ) 

The task can be divided into two parts: First we have to separate the 
projections of the individual structures f; from the linear combination of the 
projections of the 3D dynamic object f [see (15.80)]. Next we reconstruct the 
cross sections of each 3D factor structure independently. 

In order to test the possible application of EDT in such cases, a phantom 
model was selected and the reconstruction performed. 
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Fig. 15.8. The arrangement of the opposite absorbed projections. 


15.4.1 Phantom 


We implemented and tested a two-step procedure for reconstructing phys- 
iologically separable factors of a simulated renal system (the simplified 3D 
mathematical model of the kidney was provided by Dr. W. Backfrieder, AKH 
Vienna, Austria [1]). The phantom satisfies the assumption of factor analysis 
that the structures are homogeneous, i.e., each structure is a composition of 
identical voxels. The system consists of five factors representing two vascular 
and two renal structures and the urinary bladder (i.e., K — 5). They are in 
a discrete space of 64? voxels (voxel size is 6 x 6 x 6 mm). Each structure 
is defined by a geometric object (e.g., by discrete spheres, tubes). Table 15.1 
shows some information about these structures. 


'Table 15.1. Structures of the Given Phantom 


Structure name Organ Volume 

Vascular structure 1 heart, aorta 2652 voxels 
Vascular structure 2 liver, spleen 10603 voxels 
Renal parenchymas two identical items 1350 voxels 
Renal pelvis two identical items 606 voxels 
Urinary bladder bladder 2094 voxels 


Background rest of the 64? cube 
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The background represents the rest of the cube. Absorption, scatter, 
depth-dependent resolution, partial volume effects, and also Poisson noise 
have been taken into account in this simulation in order to approximate a 
real nuclear medicine SPECT study. The factors change their weights (inten- 
sities) with time according to some given functions c(t), k = 1,2,...,5. Four 
absorbed projections with size 64 x 64 are taken at 120 discrete-time moments. 
The simulated structures have specific dynamics (radioactivity changes with 
time) so that they can be separated from each other by factor analysis. 


15.4.2 Factor Analysis 


Factor analysis (FA) was performed for each sequence of projections (left, 
right, up, and down) by the method using spatial constraints, e.g., nonneg- 
ativity of the factors [24, 25]. For each direction, the results of the FA are 
five 64 x 64 factor images and the corresponding five vectors with 120 factor 
coefficients. However, the output factor images are not necessarily the projec- 
tions of the factors. What we obtained as factor images are the projections 
of the factors up to some multiplicative constants only. That is, we have five 
images uo» ge (see Fig. 15.9) and five vectors c9, k — 1,2,...,5, at the 
end of the FA step applied to the projection up. The relation between fk, Ck 
and je, c) is c(t) - fr(r) = ck) (t) : (r), for all k = 1,2,...,5. Similar 
relations can be set up for the results of the FA applied to the images taken 
in other directions. The curves containing the resulting coefficients are given 
in Fig. 15.10. 


15.4.3 Reconstruction 


Although se and the factors computed by FA are not necessarily binary func- 
tions, they are still two-valued [the range of function f(u), contains the 0 and 


another real number, say I Tuy In order to reduce the problem to binary recon- 


struction problems, we have to find the values T w, This pre-reconstruction 


step was done by a heuristic algorithm described in [21]. 

Knowing the four absorbed projections of each factor, we can try to re- 
construct them by some EDT reconstruction method. Let us suppose that the 
2D cross sections of the factor functions can be represented by n x n binary 
matrices, or—equivalently—by binary vectors x € (0,117, J = n?. 

Our EDT reconstruction problem can be described as a linear equation 
system 

Ax = b, such that x is binary. (15.83) 


Instead of solving the equation system (15.83) directly, it is reformulated as an 
optimization problem. Formally, find the minimum of the objective function 


C(x) = || Az — b||? + y - (a), such that x is binary. (15.84) 
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(c) (d) 





(e) 


Fig. 15.9. The up-projections of the factor structures. (a) Heart and aorta. (b) 
Liver and spleen. (c) Renal parenchymas. (d) Renal pelvis. (e) Bladder. 
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Fig. 15.10. Curves computed by FA and showing the changes of factor coefficients 
in time in all four projections. (a) Heart and aorta. (b) Liver and spleen. (c) Renal 
parenchymas. (d) Renal pelvis. (e) Bladder. 


The first term in (15.84) controls that we have an x satisfying (15.83) 
at least approximately. The second term gives us the possibility to include a 
priori knowledge about x into the optimization. In our case, we have used the 
function 


J J 
(x) = X >) Gold(j, 0) - |z; — zel, (15.85) 
j=0 £=0 
where G, is the Gaussian function with parameter c and d(j, £) denotes the 
distance between the jth and /th pixels. This regularization term forces the 
optimization procedure to find images having possibly large connected regions 
that have the same binary values. 

For solving (15.84), the simulated annealing (SA) optimization method 
[20] was implemented. 
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15.4.4 Results 


The factor analysis successfully separates the projections of individual dy- 
namic objects; see Fig. 15.9 demonstrating the separated up-projections and 
Fig. 15.10 showing the factor weights (coefficients) changing in time in all four 
projections. It is interesting that the structures in the center of the 3D cube 
space [heart and aorta, Fig. 15.10(a) and bladder Fig. 15.10(e)] have almost 
the same curves in all projections (absorption has the same effect in all four 
directions). The effects of absorption in the case of other structures located 
off-center of the 3D cube are different in different directions. 

For the reconstruction, we had to choose a proper regularization scalar ^ 
in (15.84), then we could perform the reconstructions slice by slice for each 
structure (see Fig. 15.11). After reconstruction, we could calculate the volumes 
of each structure (see Table 15.2). The percentages in Table 15.2 give the 
values relative to the real volumes (see Table 15.1). 


Table 15.2. The Volumes of the Reconstructed Structures 


Structure name Original volume Reconstructed volume 


Vascular structure 1 2652 voxels 2546 voxels (96%) 
Vascular structure 2 10603 voxels 9629 voxels (90%) 
Renal parenchymas 1350 voxels 1450 voxels (107%) 
Renal pelvis 606 voxels 513 voxels (85%) 
Urinary bladder 2094 voxels 1932 voxels (92%) 


Having reconstructed all sections of all structures, we could make a 3D 
visualization by the program package Slicer [26]. The result can be seen in 
Fig. 15.12. 

During reconstruction we observed that the liver and spleen (vascular 
structure 2) were considerably less smooth than other structures. This error 
can have two likely reasons: 


(a) Factor analysis error — due to 

(i) complete overlaps of the liver and spleen in the left and right projec- 
tions, and 

(ii) high attenuation of both structures in all four projections. One of the 
two structures is lost in each of the two lateral (£ and R) projections. 
Such data represent a difficult task for factor analysis. 

(b) Reconstruction error — factor images of the liver and spleen are of low 
contrast and noisy in all projections. This is a likely reason for “blurriness” 
or poor smoothness of those structures in our reconstructions. In fact, it is 
not a reconstruction error as such — the result is a realistic reconstruction 
of noisy factor objects. 


Looking at Table 15.2, we can conclude that the volume of the factor struc- 
tures was near the expected values (with an error less then 10%), the worst 
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Fig. 15.11. Representative reconstructed slices of the structures. (a) Heart and 
aorta. (b) Liver and spleen. (c) Renal parenchymas. (d) Renal pelvis. (e) Bladder. 
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(a) (b) 


Fig. 15.12. 3D visualization of the reconstructed structures from two views. 


value is in the relatively small renal pelvis, which could be recostructed with 
an error of 15%. It is important to remark here that during the reconstruction 
we considered the absorption without any correction of scatter and noise. 

In essence, our method provides a solution to a general problem of recon- 
structing 3D dynamic binary structures from a small number of projections. 
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Application of a Discrete Tomography 
Approach to Computerized Tomography 


Y. Gerard and F. Feschet 


Summary. Linear programming is used in discrete tomography to solve the relaxed 
problem of reconstructing a function f with values in the interval [0, 1]. The linear 
program minimizes the uniform norm ||Az — 6||.. or the 1-norm || Ax — 6||1 of the 
error on the projections. We can add to this objective function a linear penalty 
function p(x) for trying to obtain smooth solutions. 

The same approach can be used in computerized tomography. The question is if 
it can provide better images than the classical methods of computerized tomography. 
The aim of this chapter is to provide a tentative answer. We present a preliminary 
study on real acquisitions from a phantom and provide reconstructions from a few 
projections, with qualities similar to the traditional methods. 


16.1 Introduction 


Computerized tomography (CT) is a technology of major importance in med- 
ical imaging and in several other fields related to nondestructive material 
testing. Although there has been considerable progress since the 1970s, im- 
proving the methods remains an important aim. The two important features 
of a reconstruction algorithm are its time of computation and the quality 
of the reconstructed images. The time of computation has been a decisive 
consideration in CT, but the improvement of CPUs now allows us to investi- 
gate methods that used to be considered too time-consuming. The hope is to 
enhance significantly the quality of the reconstructed images. 

The notion of quality of an image is essential. What is a satisfying image 
and what is a nonsatisfying one? The answer to this question determines the 
operational details of many CT algorithms. If the question is put to a physi- 
cian, he will answer in terms of diagnosis: The best image is the one that 
allows the best diagnosis. Unfortunately, medical knowledge can be hardly 
translated into numerical data. The usual way to express the quality of an 
image in regard to the data is to evaluate the physical adequacy between the 
image and the measurements. This value is called the likelihood of the image. 
Its mathematical expression involves a model of the physical process of the 
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measurement. According to this point of view, the problem of tomographic 
reconstruction becomes the problem of computation of the most likely image 
given the data (if we add some constraints of speed, the problem is to com- 
pute in a bounded time the most likely image as possible). This is a problem 
of optimization. A description of a reconstruction algorithm requires in this 
framework three specifications: 


(a) What is the chosen mathematical model of the physical process of the 
measurement? 

(b) What is the mathematical expression of the likelihood of an image given 
the data? 

(c) What is the method of optimization used for obtaining an image of max- 
imal likelihood? 


There are as many tomographic methods as replies, and we are going to 
present a method by answering, briefly in this introduction and with more 
details in the following, these three questions as follows: 


(a) Regarding the mathematical model of the physical process of the mea- 
surements, it is possible to take into account the probabilistic laws that 
govern emission or absorption of particles [20], but we use a more basic 
model: a Dirac model, where the measurements are assumed to be the 
sums of the values of points belonging to digital lines. 

(b) Regarding the second question, we express the likelihood of an image using 
the norm of the error vector, where the error vector is the difference be- 
tween the real measurements (the data) and the measurements simulated 
from the image (with sums according to discrete lines). 

(c) The chosen method of optimization is linear programming (LP). This 
approach, although novel in the framework of CT, comes from DT, where 
LP has been used since the 1990s [1, 4]. LP requires a linear form for the 
likelihood. In spite of this constraint, Weber et al. [22] have shown that a 
linear regularization term can be added to the objective function in order 
to obtain smooth reconstructed images. 


The fact that LP has been first used in DT can perhaps be explained by 
the time of computation and the size of the reconstruction instances, which 
are smaller in DT than in CT, since binary constraints make the problem 
harder. The converse is, however, more usual: Many ideas of DT have been 
first investigated in CT [16, 21]. This is, for instance, the case with Bayesian 
methods. A methodology for obtaining a DT algorithm from a CT method is 
discussed in [2]. The idea is here to do the converse: Obtain a CT algorithm 
from an approach that has been first developed for DT. 

The use of LP for DT goes back to the early days of the field. The relaxation 
of the binary constraint x € (0,117 to æ € [0,1]? has been introduced in DT 
by R. Aharoni, G.T. Herman, and A. Kuba in [1], who showed that with this 
relaxed constraint the problem of reconstruction could be solved by LP. Their 
idea was to use this routine for obtaining binary images, and the technique 


16 Unique Reconstruction of Bounded Sets 369 


has been followed up by several authors [4, 8, 22, 23]. By using the same kind 
of method in CT, we are only returning the approach to its domain of origin 
with the hope that it can be helpful for providing nice reconstructed images. 

We focus in Section 16.2 on a short presentation of CT classical algorithms. 
Sections 16.3 and 16.4 are devoted to the presentation of the new method 
by answering in detail the three previous questions. Section 16.5 considers 
the various methods of optimization to solve our LP problem. Section 16.6 
presents the experimental results on a phantom. 


16.2 CT Reconstruction Methods 


The first kind of CT methods are based on Fourier analysis. They are said 
to be analytical. The physical process of the measurements is modeled by 
a continuous Radon transform [17]. No notion of likelihood is used in this 
framework, and no optimization step is required since Fourier slice theorem 
provides an exact formula for inverting the Radon transform. Thus, the main 
theoretical difficulty of this kind of method is the conversion of the discrete 
data (there are only a finite number of detectors in a camera) into a contin- 
uous function. After this first step, the application of the inversion formula 
of the Radon transform provides a continuous function that can be digitized 
in order to obtain the reconstructed image. The fact that no optimization is 
needed reduces the time of computation. This kind of method is fast, but the 
conversion from the discrete data into a continuous function requires relatively 
dense sampling. It compels the number of directions to be greater than 30 and 
a low ratio between the noise and the signal amplitude. We can say that X-ray 
scanners satisfy these experimental conditions, but not emission tomography. 
This explains why the main analytical reconstruction method called filtered 
back-projection (FBP) is used in X-ray scanners, but also that its application 
in nuclear medicine [single photon emission computed tomography (SPECT) 
or positron emission tomography (PET)] is more problematic. 

'The analytical approach is fortunately not the only option. Other meth- 
ods have been developed. They are said to be iterative, since their principle is 
to increase iteratively the likelihood of a current image given the data. Sev- 
eral models can be considered, but due to time of optimization, the rivalry 
between analytical and iterative methods has first turned to the advantage 
of analytical approach. The development of emission tomography in medical 
imaging, the enhancement of mathematical knowledge of optimization, and 
the improvement of CPUs have reverted this tendency and provided a large 
reemergence of interest in the iterative approach [13]. 

The oldest iterative method is the algebraic reconstruction technique 
(ART) [6, 7, 10, 11]. The physical model considered in this approach is dis- 
crete. There is the choice between partitioning the environment in a lattice of 
cells (pixels, voxels) or representing it by a lattice of points (a Dirac model). 
'The problem of reconstruction becomes that of attributing to each cell or each 
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point a value that can be considered to be its gray level. The mathematical 
model of the measurements assumes that each measurement corresponds to 
a linear combination of these values (the attribution of the coefficients of the 
linear combination is discussed in the next section). Thus, the problem of re- 
construction is reduced to solving a large system of linear equations Ax = b 
(one variable per pixel of the resulting image). The large size of the linear sys- 
tem is not the only reason to avoid the classical Gauss method: The feasibility 
of the linear system is not guaranteed, but it is unthinkable that due to our 
reconstruction algorithm we have to say to a physician, “sorry, the data do 
not provide any solution.” Thus, it is more natural to use iterative algorithms 
to solve the linear system Az = b, since they always provide an approximate 
solution. There exist several well-known methods in numerical analysis: Ja- 
cobi, Gauss-Seidel, or their relaxed versions. The principle of ART is quite 
similar to the one of the Jacobi approach, with the difference that it takes 
advantage of the properties of the tomographic linear systems to accelerate 
convergence. In this framework, we consider that the likelihood is monotonic 
in the difference between Az and b. This leads to a class of iterative algorithms 
for || Ax — b|| minimization, a classical approach used in numerical analysis for 
solving linear systems. Several gradient methods can be used in this frame- 
work. All these algebraic methods are based on approximating a solution of 
the linear system of equations by using iterations on a current image (or a 
current vector). 

Another approach is expectation maximization (EM) and its faster variant- 
ordered subset expectation maximization (OSEM). The mathematical model 
of the measurements uses Poisson statistics, namely “exact” probabilistic laws 
of emission and absorption of a particle [20]. This results in a nonlinear math- 
ematical expression for the likelihood of an image and a process to improve 
iteratively its value for a current image. 

All the iterative methods, from ART to OSEM, have been improved by 
adding the possibility of taking into account information in addition to the 
measurements. This can be an arbitrary choice (for instance, that an image 
should be quite regular) or knowledge coming from other equipment. This 
interaction with external data is modeled mathematically by Bayes’ theorem. 
'The likelihood becomes the product between the consistency of the image with 
prior information and its consistency with the measurements. It is also possible 
to compute their sum instead of their product (with or without adding a log), 
so that the total likelihood is the sum of the consistency with the measure- 
ments and a second term penalizing inconsistency with the prior information. 
'This modification of the likelihood has, of course, important consequences on 
the iteration process; the resulting iterative methods are called “Bayesian.” 
'The algebraic method that we present in this chapter is from this class. 
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16.3 Dirac Model with Digital Lines 


Any mathematical model of the measurement is based on a sequence of more 
or less arbitrary choices. The first important choice is between models with 
a lattice of pixels and those with a lattice of points. Although the material 
investigated by CT is usually continuous, the Dirac model (with points) can 
provide good results. 


16.3.1 Pixel Model 


In the case of a pixel model, it is often considered that a measurement by a 
detector of the camera is a linear combination of the gray levels of the pixels 
crossed by the corresponding straight line. The contribution of a pixel to the 
measurement is weighted by the length of its intersection with the line. This 
is the principle of the length ray model (Fig. 16.1). 





detector 











Fig. 16.1. In the length ray model, the measurement of a detector is the linear 
combination of the gray levels of the pixels crossed by the corresponding line. The 
contribution of each pixel is weighted by the length of its intersection with the line. 


16.3.2 Notation 


In the following, we consider a lattice of points that is embedded in a real 
Euclidean space. The lattice consists of all points that are integer linear com- 
binations of two vectors 7 and j. The Euclidean scalar product of vectors spec- 
ified by (u,v) and (a, y) (i.e., ui + vj and xi + yj) is denoted by (u,v).(z, y). 
The Euclidean norm associated with this dot product is denoted by ||(x, y)]|. 
We notice that, generally speaking, (u, v).(x, y) is different from ux + vy and 
\|(x, y)|| is different from y xz? + y?, since we do not assume that į and j are 
vectors of norm 1 perpendicular to each another. If they are perpendicular 
and their lengths are both equal to l, then the dot product can be expressed 


directly by (u, v).(z, y) = l2 (ux--vy), and it follows that ||(z, y)|| = Iz? + y?. 
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16.3.3 Basic Dirac Model with Digital Lines 


We consider in our Dirac model that the measurement of a detector is the sum 
of the gray levels corresponding to lattice points belonging to a strip (Fig. 
16.2). Such a structure is called digital line: A digital line consists of those 
lattice points (x, y) that are solutions of an equation of form h < (u, v).(z,y) < 
h+6 [18], where the vector of coordinates (u, v) [assumed different from (0, 0)] 
is normal to the direction of the line. The value vih] is the shift of the digital 
line with respect to the origin, and the value ó/||(u, v)|| is the thickness of the 
digital line [|h] and 6/||(u, v)|| are lengths]. 


7 detector 





Fig. 16.2. On the left, the measurement of a detector is the sum of the gray levels 
of the pixels belonging to a digital line, namely the sum of the values of the points 
belonging to a strip (the weight of each point is 1 if it belongs to the corresponding 
digital line and 0 otherwise). On the right, the tiling of the lattice points by parallel 
digital lines. This model guarantees a uniform contribution of all points to the 
measurements in any direction. 


It seems natural in this framework to choose digital lines, where the thick- 
ness 0/||(u, v)|| is equal to the width of any detector (Fig. 16.2). By assuming 
that the detectors are placed contiguously (without spaces between them), 
this provides in each direction a set of digital lines tiling the lattice. This 
model is our basic Dirac model with digital lines. 


16.3.4 Advantages and Drawbacks 


'The first advantage of the previous model is that it provides linear combina- 
tions with only Os and 1s as coefficients. As each point in the image belongs 
to exactly one digital line of the tiling, the contributions of all the points are 
equal in all directions (Fig. 16.3). To this advantage, there is a drawback. Let 
us assume, for instance, that all the points belonging to a region of interest 
(ROI) have a gray level equal to 1, while all the others have a null gray level. 
Thus, the value measured on a detector is equal to the number of points in 
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the intersection between one digital line and the ROI. Due to the definition 
of digital lines, the simulated measurements have unrealistic structural vari- 
ations (Fig. 16.3). This drawback comes from the variation in the number of 
lattice points in consecutive digital lines. The first part of this variation is 
due to the length of the segment cutting the lattice (an equivalent variation 
is also observed in the length ray model), but there is also a second term: A 
given digital line has more or less thin sections. By considering the intersec- 
tion between parallel digital lines of constant thickness and the rectangular 
support of the image, some digital lines have a thin section in the rectangle 
while others can have a thicker one. It provides an unrealistic variance in 
the simulated measurements. This detrimental effect can be important in the 
two diagonal directions (the ratio between the cardinalities of parallel digital 
segments of same length can be equal to 2) and even in the horizontal and 
vertical directions of the lattice. 


N 
) 


O 


b 


4 


Q 


IN 
N 





YÀ 
3 v4 7 points 
B 4 points 
7 points 


Fig. 16.3. A region of interest is represented in gray. If we assume that the points 
in this region have a gray level equal to 1 while all other gray levels are null, this 
provides us with a sequence 7, 4, 7 in the simulated measurements, which is not 
realistic. 


16.3.5 Improving the Model 


We could hope that previous detrimental effect of cardinalities variance re- 
mains quite small in practice, but we prefer investigating different ways of 
improving the model by introducing some corrections: 


(a) Choose a starting angle for the reconstruction that restricts cardinalities 
variance. This becomes difficult if the number of directions of the mea- 
surements becomes large. 

(b) Use digital lines that do not have a fixed thickness 6/||(u, v)||. This means 
that the thickness of the digital lines will not always be equal to the 
width of the detectors. One consequence is that the set of digital lines 
considered in one direction does not tile the lattice anymore. Naive digital 
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lines (6 = max{|(u, v).(1,0)|, |(u, v).(0, 1)|}) or standard digital lines (6 = 
\(w, v).(1, 0)| + |(u, v).(0, 1)|) [18] can be used. We call this model a Dirac 
model with naive or standard digital lines. 

(c) Weigh the contribution of the points by the total number of points in the 
digital line. We call this reasonable option the cardinal correction in the 
experiments. 


The two last corrections destroy the property that all points have the 
same contribution to all measurements. They favor another property that is 
verified by the length ray model: The sum of the coefficients for computing 
each simulated measurement according to all detectors is proportional to the 
length of the intersection between the corresponding line and the lattice. We 
can ask for a perfect model to conciliate perfectly both requests (such a model 
would be perfect, but it seems hard to obtain). 

The three models tested in the experimental section are the (basic) Dirac 
model with digital lines, the Dirac model with naive digital lines (which do not 
tile the lattice), and the Dirac model with digital lines corrected by weights 
associated with cardinalities (cardinal correction). 


16.4 Likelihood 


Regardless of the choice of the algebraic model (length ray model or Dirac 
model with digital lines), it provides a linear operator A that allows us to 
express the simulated measurements from a gray-level image x as the linear 
product Az. 

Denoting the vector of the experimental measurements by b, the problem 
of reconstruction is to compute a gray-level image x satisfying Ax = b. There 
may not be an exact solution, and it is necessary to compute an approximative 
one. This is the idea that we have introduced with the term “likelihood”. Any 
image x may be more or less likely with respect to the measurements b. The 
ideal image satisfies Ar — b, while an unlikely image is characterized by an 
Ax far from b. Thus, it is natural to consider that the likelihood of z can be 
evaluated from ||.Ax — b|| with a norm chosen between ||.||oo, ||.]|1 and |].|[2. 

This approach is efficient, but better results can be obtained by adding a 
Bayesian correction: We add to the norm of the error Ax — b a second term 
based on a smoothness prior. 

'The idea to consider a smoothness prior in conjunction with the norm 
of the residual error was already present in Chapter 13 of [9], while the in- 
troduction of a regularization term goes back to 1976 [12]. This approach 
with the Euclidean norm has been, for instance, investigated in CT in 1992 
by L. Kaufman [14, 15] (|| Ax — b|| minimization is the classical least-squares 
method), while an approach with ||.||; or ||.||aa norms has been investigated 
by S. Weber et al. in the framework of DT in [23]. Our contribution to this 
technique starts in [8, 5]. 
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The principle is to express the likelihood as the sum of ||Ax — 6|| and a 
penalty term p(x) penalizing an irregular x. One choice is to express p(x) as 
the sum of quadratic terms )>,<;(%i — xi)”, where I is the set of the edges 
between neighboring points or pixels in the lattice, and where 7’ and i” denote 
the vertices of edge i. Thus, p(x) is null if, and only if, the gray level of x is 
constant. This term provides a measure of the local homogeneity (the smooth- 
ness) of the image x, but, its expression being quadratic, it cannot be used in 
our linear approach. À method to obtain a linear equivalent of this value of ho- 
mogeneity is presented in [22]. It consists of introducing one auxiliary variable 
(denoted y;) for each edge i between neighboring pixels or points ?' and i". By 
imposing the two constraints (1) y; > xy — aj and (2) yi > zi» — zy, we can 
be sure that y; > |x — zj^|. We consider that the value |x; — zj"| expresses 
the local homogeneity of x around the edge i. Thus, the sum »5;c; |; — zi" | 
expresses the overall smoothness of x. The sum of auxiliary variables 5 /;- ; yi 
is necessarily greater than the smoothness value 5 /;-; |r; — xi” |, but, as there 
is no other constraint on y than (1) and (2), $ ez yi minimization guaran- 
tees equality X-er Yi = J; |vv — xi |. This provides a way to incorporate the 
smoothness of x in an objective function without using any absolute value or 
quadratic expression but with auxiliary variables. Some other penalty terms 
can also be expressed linearly with the trick of using an auxiliary variable, but 
in order to reduce the number of parameters of the method, we have focused 
our experiments on the edge penalty. 

Now that we have expressed the penalty p(r) by a linear term that can 
be incorporated in an objective function, it remains to do the same with the 
norm || Az — b||. This is, of course, not possible in the case of Euclidean norm, 
but for ||.||; and ||.||;;, we use the same kind of technique. 

We start with the ||.||1 norm by introducing a vector of auxiliary variables 
h related to x by the linear constraints (3) —h € Ax — b < h. This guarantees 
the inequality ||Ax — b||; € 1.h, where 1.h is the dot product between h 
and the vector 1 of constant coordinate equal to 1. If no other constraint 
on h is introduced, 1.5 minimization guarantees equality ||Ax — b||; = 1.h. 
Thus, we consider the likelihood of x as the sum likelihood(x) = ||Ax — 
blli +K uere — z|, where K is a positive constant controlling the weight 
of regularization (also referred to as the "entropy" ahead) with respect to 
the error on the measurements. This nonlinear value can be minimized on 
x by minimizing the linear objective function 1.h + K 5 5;-, y; under linear 
constraints (1), (2), and (3). 

With the ||.||.. norm, we use the same principle, but instead of bounding 
Ax — b with a vector h, we use a real number h. The linear constraints (4) —^1 
< Ax —b < hl imply the inequality || Az — ||. < h. If we introduce no other 
constraint on h, the minimization of h leads to the equality || Ax — b||;; = h. 
Thus, we express the uniform likelihood of x as likelihood,.(x) = ||Ax — 
bll + K er |vzs — vi^], where K is again a positive constant controlling 
the relative weight of smoothness penalty. This nonlinear expression can be 
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minimized on x by minimizing the linear objective function h + K c, yi 
under linear constraints (1), (2), and (4). 


16.5 Optimization 


As we have reduced the problem of minimization of the two Bayesian expres- 
sions of likelihood likelihood;(x) and likelihood,,() (a low value corresponds 
to a likely image) to a problem of minimization of a linear objective function 
under linear constraints (a linear program), it remains only to solve it by us- 
ing an LP algorithm. Several methods can be used: primal or dual or even 
primal-dual simplex algorithms [19] or interior points methods [25]. Special 
techniques could also be used, but we believe that standard methods in LP 
are efficient enough to solve our problem. We tested the solution of our LP 
problem using both soplex [24], which is freely available, and the solver Cplex 
9.1, which is probably the best commercial solver available today. All tests 
were done on a Pentium IV Xeon machine with hyperthreading and 1 GByte 
of main memory. soplex furnishes an implementation of the primal simplex 
method, whereas Cplex offers the primal and dual simplex methods and an 
implementation of an interior points method. All optimization methods are 
iterative. It should be noted that simplex-based methods are exact, whereas 
interior points methods are approximate in the sense that they find only ap- 
proximations to the optimal solutions. However, interior points methods could 
be pursued until a guaranty of optimality is reached. As this feature is imple- 
mented in Cplex, we choose to always optimize until optimality is reached. 
This means that our running time will be higher for interior points methods 
than what is really needed to provide a good solution. We have estimated 
that times could be divided by two if we were interested in an approximate 
solution that does not differ much from the optimal one. 

Since we provide the possibility to solve the tomographic reconstruction 
problem with either the ||.||,. or the ||.||1 norm, we tested both reconstructions 
with the different optimization method. It should be noted that with the ||.||oo 
norm, we sometimes obtain problems that are not solved with the interior 
points algorithm (it terminates with an infeasibility error), while the dual 
simplex algorithm always works. Such problems with interior points methods 
never happen with the ||.||; norm, and we believe that they are related to the 
use of the ||.||;3; norm. 

'To appreciate the times of the different optimization methods, we mention 
that a typical LP problem has 15,000 constraints and 11,000 variables for a 
reconstruction with 16 projections on a 64 x 64 matrix. The LP system is 
always sparse for the ||.||; norm but contains at least one dense column for 
the |].||.. norm, which means that the latter optimization problem is harder 
to attack and is less efficient. In a previous paper [5], experiments done with 
soplex took 6 hours, which was prohibitive for clinical use. When switching 
to Cplex and the dual simplex method, our running times reduced to less than 
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10 minutes. This is already progress. When using the ||.||; norm, with which 
we always reach an optimality condition with interior points methods, our 
running times reduced to 25 seconds. In the last experiments presented in the 
next section, no computation took more than 25 seconds to be completed with 
a certificate of optimality of the solution. It is, of course, much more time than 
classical methods. The slowness of this approach is the consequence of using 
a general linear programming method, which is not optimized for the kind 
of instances that we have. It remains, of course, possible to stop the process 
of optimization before its end, and we can also imagine optimizing the linear 
programming algorithm for working with the instances of our problem, but 
these two perspectives have not been yet investigated. 


16.6 Experiments 


The experiments are based on a phantom acquired by the Center of Nuclear 
Medicine of Clermont-Ferrand. The projections were acquired by single photon 
electron computer tomography (SPECT) on a lattice of 64 x 64 detectors with 
a step angle of 12° along an arc of 180° (Fig. 16.4). 





Fig. 16.4. The 16 measurements of the Phantom with a step angle of 12°. Thus, 
the first and the last images are projections in opposite directions. 


The phantom is composed of a surrounding envelope in plexiglas that does 
not interfere too much with the measurements. Inside the plexiglas structure, 
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six balls containing radioactive liquids are placed in the same horizontal plane. 
The balls have varying radii and concentrations. The smallest ball has a size 
close to the resolution of the measurement device. Since we only use 16 pro- 
jections instead of the usual 64 projections, it is very improbable that the 
smallest could be visible in the reconstructions. 





Fig. 16.5. Reconstruction using the ||.||; norm. 


The first reconstruction was done with the ||.||;5 norm (Fig. 16.5). We have 
done many tests with varying K to see if a better image could be obtained but 
without success. We obtain a poor-quality image because of the noise present 
in the projections. If we suppress completely the entropy by setting K = 0, 
we obtain a worse image with a complete disorder. Due to all these reasons, 
we decided to focus only on the ||.|; norm with edge entropy and a factor 
K — 0.4. It is clear that in some experiments better images could be obtained 
based on a careful analysis of the K-value but, for comparison purposes, we 
chose to impose the value of K such that direct comparisons are possible. 


Fig. 16.6. Reconstruction with the naive digital lines model, the standard digital 
lines model, and the digital lines model (6 = Va? + 62). The third model is the 
so-called (basic) Dirac model with digital lines. 


We tested reconstruction when using the basic Dirac model with digital 
lines (their thickness 6/||(u, v)|| is equal to the width of the detectors so that 
the set of digital lines in each direction tiles the lattice), and we tried the 
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two corrections suggested to improve the model by using the naive and the 
standard digital lines. The corresponding results are given in Fig. 16.6. The 
basic model provides the best reconstructed image with better localizations 
of the objects. When using standard lines, the objects are less localized and 
several artifacts appear in the reconstruction. The result with naive lines is 
not so bad, but since all directions are not of equal geometric thickness, some 
directional artifacts appear. It should be noted that the result with naive lines 
is less spread out than with standard lines. It is a consequence of the smallest 
geometrical thickness of naive lines. 





Fig. 16.7. Reconstruction: (top left) original, (top right) with cardinal correction 
only, (bottom left) with normalization correction only, (bottom right) with both 
corrections. 


Following these reconstructions, we focused on other corrections with the 
(basic) Dirac model with digital lines. The first correction is a normalization 
that modifies the measurements in order to give them the same mean value 
in all directions. The second correction is the cardinal correction, introduced 
for counterbalancing the variations of numbers of pixels in neighboring digital 
lines. Experiments with or without these two corrections (Fig. 16.7) show 
that they are both efficacious. The cardinal correction results in images that 
are less spread out. The normalization correction leads to images with more 
localized objects. Finally, we applied both corrections so that both effects are 
cumulated. Contrary to what we thought first, this does not lead to a better 
image. In fact, this last image does not have a good localization property and 
has more artifacts. Due to this cumulative effect, we have chosen to use only 
the normalization correction in next experiments. 
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Fig. 16.8. Sinograms for the slices numbered from 31 to 34 with normalization only 
(order is from top to bottom and left to right). 


A careful look at the reconstruction shows that merely three balls are 
correctly reconstructed. We always used slice 33 in the projections, which cor- 
responds to the plane passing through the center of the balls. To understand 
the intrinsic complexity of reconstructing more than three balls, we provide 
the sinograms for the slices 31 to 34 in Fig. 16.8. 

Clearly, three balls are visible. One of the other three can be seen in only 
two projections (see top of Fig. 16.8), and the two smallest ones are not visible 
at all. T'his explains why we cannot reconstruct more than three balls. Since 
all projections have equal importance, the fourth one is not present enough 
to be reconstructed by our method. It should be noted that this ball could be 
visible if the contrast in the sinogram was better (a contrast correction could 
be applied before reconstruction). 

As shown in the sinograms above, the location of the plane containing 
the center of the balls is not clear. Thus, we reconstructed all slices between 
the levels 31 and 34. The results are presented in Fig. 16.9. Level 32 is the 
level with the minimum number of artifacts. However, the contrast of the 
reconstructed balls is less than in level 33. This is why we have chosen to 
do all our experiments with level 33. Besides this, a careful look at the im- 
age of level 32 shows that the shadow of the fourth ball is perhaps present 
in the reconstruction. The position of the concentration of high values is 
consistent with the geometry of the phantom and the regular placement of 
the balls. This is obviously not sufficient for a firm conclusion, but we be- 
lieve it to be in favor of the quality of discrimination of our reconstruction 
method. 
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Fig. 16.9. Reconstruction for the slices numbered from 31 to 34 with normalization 
only (order is from top to bottom and left to right). 





Fig. 16.10. Butterworth post-filtering with varying f in {3,5,7,9} and n = 2 (order 
is top to bottom and left to right). 


In common clinical use, pre- and postfiltering are used to provide more 
readable images for the clinicians. One standard filtering is the Butterworth 
filter, given in the frequency domain by 
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1 
B(w) = — m’ 
be) 


where f is the cut-off frequency, which limits the influence of the filter, and n 
is the order of the filter. We provide two types of experiments by fixing of the 
parameters f or n and modifying the other one. We first present the impact 
of the cut-off frequency with a fixed order in Fig. 16.10. 


(16.1) 





Fig. 16.11. Butterworth postfiltering with varying n in {1,2,3,5} and f = 7 (order 
is top to bottom and left to right). 


The impact of f is clear. Smaller values lead to fewer frequencies preserved 
in the frequency domain. Thus, the images tend to be overblurred and details 
are simply removed. When f is increasing, we tend to the original image. Thus, 
f must be chosen not too low (to retain details), but not too high (to filter 
artifacts). The value of f thus depends on the relative frequencies between 
the details and the artifacts. 

In a second sequence of experiments, we fixed f = 7 and modified n. 
Intuitively, n governs the speed of the decay of the influence of the filter. This 
is confirmed in Fig. 16.11 by the fact that the balls become more blurry with 
high n, corresponding to the fact that fewer frequencies are kept by the filter. 

To investigate the impact of postfiltering, we searched for a compromise 
between f and m to provide the nice filtered image that is presented in Fig. 
16.12. 

We now compare our reconstruction with the two standard methods: FBP 
and OSEM. In Fig. 16.13, we provide the reconstructions made by FBP and 
OSEM from 16 projections. 
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Fig. 16.12. Butterworth filtering with f = 10 and n= 4. 





Fig. 16.13. (Left) FBP reconstruction, (right) OSEM reconstruction. 


The FBP reconstruction is usually performed with 64 projections. When 
the number of projections is reduced, the star artifacts become too noticable. 
Hence the reconstructed image is polluted by this effect and is not suitable 
for automatic inspection. However, the three main balls are clearly visible on 
the reconstruction. 

The image reconstructed by OSEM is of good overall quality. As we can 
see, the three main balls are clearly visible and the shadow of the fourth one is 
present. Moreover, very few artifacts are present in the reconstructed image. 

When comparing the OSEM reconstruction with our reconstruction, we 
can clearly see that the quality of our method is close to the one of OSEM in 
this experiment. There is still a little bit more artifact in our reconstruction, 
but the discrimination power toward the different balls is almost similar. 

To end this experimental part, we provide in Fig. 16.14 the reconstruction 
using only eight projections and no postfiltering. Note that this reconstruction 
was impossible with both the FBP and the OSEM methods, since the software 
of the machine does not accept less than 16 projections. 
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Fig. 16.14. Reconstruction using LP and only 8 projections. 


The quality of the reconstruction with only eight projections is not so 
bad. The three main balls are still clearly visible. Of course, having fewer 
projections provides more artifacts, but we hope that our approach can be 
robust toward the number of projections. 


16.7 Conclusion 


The three features of our method are 


(a) a Dirac model with digital lines, 

(b) an expression of the likelihood as sum of a smoothness prior and the norm 
|| Aa — b||1 of the error between the simulated measurements Ax and the 
real measurements b, 

(c) and a LP method of optimization. 


This approach comes directly from the algebraic model used in classical 
ART, with the difference that it guarantees the optimality of the likelihood 
of the result. 

A comparison of the experimental results with ART should be the next 
step of our investigations as well as the comparison with iterative least-squares 
algorithms. What are the relative benefits of the ||.||; minimization of the error 
and of the Dirac model with digital lines toward these classical approaches? 
Is there really a difference and does it justify such an increase in the time of 
computation? 

Whatever the real practical interest of our approach, it is not completely 
surprising that its principle comes from DT, because the small number of 
directions used in this framework requires us to make the most of the mea- 
surements. It is in such conditions that our method can find its application. If 
the measurements are very accurate, the use of such an optimal method is not 
necessary, but it becomes interesting only in the case of noisy projections in 
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only a few directions. Unfortunately, this kind of instance is less uncommon 
than we could hope, and it clearly appears that if a method admits a decrease 
in the quality of the measurements without impairing the reconstructed im- 
age, it would allow us to improve the productivity of some very expensive 
equipments. 
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equality position, 161 

equilibrium state, 306 

excess edges, 186 

existence, 3 


FBP (filtered back-projection), 369 
FCC (face-centered cubic) grid, 262 
feasibility, 207 
filling operation, 42 
finite n-length (Go-representation, 346 
flow, 178 

integral, 181 

maximal size, 179 
flow conservation constraint, 179 
FOM (figure of merit), 265 
FOM (figure-of-merit), 295 
FPI (flat panel image), 316 


Gale-Ryser theorem, 115 
generating polynomial, 59, 62, 76, 78 
geometric tomography, 3, 19 
Gibbs 
distribution, 250, 281 
parameter, 250 
priors, 248 
gluing, 157 
gluing sequence, 164 
grain, 271 
map, 272, 274 
reference, 292 
orientation, 271 
graph of a matrix, 96 
gray-value image, 249, 251 


Hammer’s X-ray problems, 19 
Hamming loss function, 253 
hyperpartition, 96, 105 


ICM (iterated conditional mode), 253, 
258 

image function, 306 

instability, 29 

integer programming, 228 

integral flow, 181 

intensity correction, 313 

interchange, 116, 118 


label image, 249 

labeling, 249 
algorithm, 274 

lattice, 274 
P-polygon, 22 
U-polygon, 20 
crystalline, 271, 274 
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cubic, 279 
direction, 32, 177, 194 
line, 22, 177 
orientation, 271, 274 
polygon, 22 
set, 4, 21, 31, 177 
simple cubic, 276 
least-squares, 227 
regularized, 228 
likelihood, 249, 367 
mean-by-the-mode, 254 
smoothness prior, 375 
limited view angle tomography, 316 
line 
edge, 178 
node, 178 
line-convex set, 33 
local 
configuration, 282 
features, 250 
method, 253 
potential, 250 
smoothness, 193 
logarithmic transform, 305 
Lorentz theorem, 337 
loss function, 249, 253 
LP (linear programming), 368 


majorization 
definition of, 85 
map 
grain, 272, 274 
reference, 292 
orientation, 274 
reference, 292 
MAP (maximum a posteriori probabil- 
ity) estimator, 248, 253 
marginal, 89 
posterior, 253 
material 
moderately deformed, 287 
texture, 279 
undeformed, 274 
matrix 
(0, 1)-, 88 
(0, 1)-additive, 89 
7-unique, 96, 105 
n-dimensional, 88 
additive, 100, 105 
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binary, 88 
minimal, 96 
real-minimal, 105 
uniqueness, 89 
max flow, 179 
algorithms, 189 
problem, 188 
MCM (Markov Chain Monte Carlo), 
253 
measurement, 249 
measurement vector, 251 
Metropolis 
algorithm, 256, 281, 290 
cycle, 282 
min cost flow, 187 
algorithms, 189 
implementation, 189 
problem, 182, 188 
minimal corner, 63 
moderate deformation, 274, 287 
MPM (marginal posterior mode) 
estimator, 248, 253 


NDT (non-destructive testing), 304 
neighborhood, 193 

radius, 193 
network flow, 179 

model, 186 

problem, 188 

time complexity, 188 
noise level, 259 
noisy projections, 184 
nonadjacency constraint, 127 
normal cone, 231 
normality assumption, 254 
NP-hard, 190 
numerical continuation, 230 


optimality 
global, 231, 238 
local, 231 
optimization 
concave, 229 
convex, 230, 234 
DC, 233 
global, 229 
interior point, 376 
linear programming, 376 
nonconvex, 232 


quadratic, 231 

simplex algorithm, 376 
orientation 

average, 276, 288 

distance, 279 

grain, 271 

lattice, 271, 274 

map, 274 

reference, 292 

spread, 288 

oversmoothing, 248, 261 


P-MAP (pseudo-MAP) estimator, 255 
P-MPM (pseudo-MPM) estimator, 255 
Pappus theorem, 28 
parallel 

algorithm, 209 

X-ray, 19 
partial volume effect, 247 
partition, 87 

conjugate, 87 
path 

northeast, 156 
permutohedron, 85, 106 
pixel, 249 

ambiguous, 288 

void, 299 
PL (pseudo-likelihood) 

approximation, 254, 255 

method, 250 
plane 

partition, 96 

set, 202 
point 

detector, 358 

edge, 178 

X-ray, 19 
Poisson noise, 208 
polycrystal, 271 
polyomino, 33, 156 
posterior probability, 249, 252 
potential, 281 
primary line edges, 186 
prior knowledge, 176, 181 
probability, 251 
problem 

consistency, 336, 339 

existence, 336, 339 


reconstruction, 336, 339, 347, 358, 
360 
weighted, 181 
uniqueness, 336, 339 
programming 
DC, 229, 232, 233, 239 
integer, 228 
projection, 3, 154, 177, 305 
absorbed, 358 
down, 358 
horizontal, 358 
left, 358 
opposite, 358 
right, 358 
up, 358 
vertical, 358 
first 
generalized, 339 
horizontal, 335, 346 
matrix, 252 
onto a convex set, 209 
second, 337 
generalized, 342 
sum, 177 
third, 337 
generalized, 342 
vertical, 335, 346 
projective transformation, 21 
protein, 247 
pseudo-posterior, 255 
pure quaternion, 277 
pyramid, 89 


Q-convex lattice set, 34 
quadrant, 34 
quadratic objective, 209 
quaternion, 276 
conjugate, 276 
identity element, 276 
norm, 276 
pure, 277 
unit, 277 


Radon transform, 89, 211, 305 
rational polygon, 22 
reconstruction, 3, 121 

algorithm, 121 

lattice, 177 

problem, 38, 155, 305 
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more than two projections, 190 
prescribed, 50 
two projections, 177 
results, 195 
reference 
cylinder, 310 
grain map, 292 
orientation map, 292 
region, 274 
relaxation labeling, 253 
residual energy, 217 
resolution, 248 
restoration algorithm, 281, 287 
RNA (ribonucleic acid), 247 
ROC (receiver operating characteristic) 
curve, 258 
rotation, 277 
composition, 278 
distance, 278 
Ryser algorithm, 117, 175 
generalized, 117 


SA (simulated annealing), 256, 291, 
306, 362 
scanning transmission microscopy, 29 
set theoretic approach, 207 
similarity fraction, 193 
simple cubic lattice, 276 
sinogram, 211 
smoothness, 193, 308 
SPECT (single photon emission 
computed tomography), 334 
start solution, 190 
statistical 
constraint, 213 
model, 248 
stop criterion, 195 
strongly 
linked pattern, 349 
side-connected discrete squares, 348 
subdifferential, 209 
subgradient, 209 
projection, 209 
sum 
broken line, 75, 76 
column, 55, 61 
directed, 68, 70 
line, 55, 61, 62 
parabola, 78 
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row, 55, 59 
switching 
1D, 347 
elementary, 347 
2D, 348 
component, 4, 5 
pattern 
boundary of, 352 
composite, 350 
core of, 352 
elementary, 349 
symmetry 
crystal, 274, 278 


texture material, 279 
thresholded median filtering, 313 
timetabling constraint, 127 
tomographically equivalent sets, 335, 
339 
tomography, 236 
binary, 227, 235, 236, 239 
discrete, 19, 235, 236 
emission, 333 
geometric, 19 
limited view angle, 316 
transmission, 227, 305, 333 


total variation, 208 
transform 

logarithmic, 305 

Radon, 305 
transmission tomography, 305 
transportation polytope, 89 
transposition, 85 


undeformed material, 274, 280 
underdetermined, 181 
uniqueness, 3, 50, 119, 120 
unit quaternion, 277 


vessel system, 236, 238 

void pixel, 299 

Voronoi neighborhood, 249, 251, 252 

voxel, 200 

VRML (virtual reality modeling 
language), 310 


weight map, 181, 192 
weighted reconstruction problem, 181 


Young pattern, 114 


Z-contrast technique, 29 
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